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ABSTRACT 


Title  of  Dissertation:  Nonholonomic  Geometry,  Mechanics  and  Control 
Rui  Yang,  Doctor  of  Philosophy,  1992 
Dissertation  directed  by:  P.  S.  Krishnaprasad,  Professor 

Department  of  Electrical  Engineering 

This  dissertation  is  concerned  with  dynamic  modeling  and  kinematic  control 
of  constrained  mechanical  systems  with  symmetry  from  a  geometric  point  of  view. 
Constraints  are  defined  via  the  characteristics  of  distributions  or  codistributions  on  the 
tangent  bundle  (velocity  phase  space)  of  configuration  space.  Lie  symmetry  groups 
acting  on  the  systems  are  assumed  to  leave  both  Lagrangian  and  constraints  invariant. 
As  a  special  case  of  mechanical  systems  with  holonomic  constraints,  we  rigorously 
analyze  the  kinematics  and  dynamics  of  floating,  planar  four-bar  linkages.  The  analyses 
include  topological  description  of  the  configuration  space,  symplectic  and  Poisson 
reductions  of  the  dynamics  and  bifurcation  of  relative  equilibria.  For  kinematic  control  of 
nonholonomic  systems,  we  mainly  study  the  related  optimal  control  problem  for  a  system 
consisting  of  a  rigid  body  with  two  oscillators.  In  particular,  the  intrinsic  formulation  and 
explicit  solvability  of  necessary  conditions  for  the  optimal  control  are  investigated  from 
a  Hamiltonian  point  of  view.  In  the  study  of  the  dynamics  of  Lagrangian  systems  with 
constraints,  the  nonholonomic  distributions  are  defined  via  arbitrary  choices  of  principal 
connections.  We  show  that,  under  our  hypotheses  on  constraints  and  exterior  force, 
the  dynamics  of  a  nonholonomic  Lagrangian  system  with  non- Abelian  symmetry  can  be 
reduced  to  a  lower  dimensional  space  determined  by  the  principal  fiber  bundle.  The 
reduced  dynamic  equations  are  formulated  explicitly.  This  formulation  generalizes  the 
one  for  classical  Chaplygin  systems  which  possess  Abelian  symmetry,  and  the  one  having 
non-Abelian  symmetry  but  with  linear  constraints.  In  addition,  if  a  special  principal 
connection,  that  is,  the  mechanical  connection  by  Rummer  and  Smale,  is  considered, 
our  formulation  for  nonholonomic  systems  also  leads  to  the  one  in  Lagrangian  reduction 
discovered  recently  by  Marsden  and  Scheurle.  The  results  of  this  dissertation  have  direct 
application  in  space  robotics  and  nonholonomic  motion  planning  in  robotics. 
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CHAPTER  I 


INTRODUCTION 


The  motions  of  various  mechanical  systems  which  we  wish  to  synthesize  and  control 
often  have  to  satisfy  certain  kinds  of  restrictions  imposed  by  the  natural  environment 
or  the  structure  of  the  systems  themselves.  In  mechanics,  such  restrictions  are  called 
constraints.  Although  the  fundamental  theory  of  mechanical  systems  with  constraints 
was  established  and  developed  in  the  last  century,  recent  research  and  developments 
in  analytical  mechanics  and  control  theory  from  a  geometric  viewpoint  have  inspired  a 
strong  desire  to  reinterpret  and  reformulate  the  theory  of  constrained  dynamics  in  an 
intrinsic  geometric  way.  In  addition,  many  practical  problems  in  recent  investigations 
in  mechanical  and  electrical  engineering,  such  as  modeling  and  control  of  mobile  robots 
and  dextrous  robotic  hands,  and  the  design  and  control  of  spacecraft,  also  show  the  need 
for  a  deeper  understanding  of  the  role  that  constraints  play  in  mechanical  systems. 

In  classical  Lagrangian  mechanics,  constraints  are  usually  classified  into  two 
types:  holonomic  and  nonholonomic  constraints.  Roughly  speaking,  if  we  exclude  those 
constraints  which  are  given  by  a  set  of  inequalities  or  time  dependent  equations,  the 
former  imposes  a  limitation  on  the  motion  of  the  system  to  a  subspace  of  the  configuration 
space,  but  the  latter  imposes  a  limitation  on  the  motion  of  the  system  to  a  subspace  of 
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the  velocity  phase  space.  If  we  further  restrict  our  attention  to  constraints  which  can  be 
expressed  via  a  set  of  linear  functions  of  velocities  of  the  motion,  using  Frobenius’  theorem 
in  differential  geometry,  the  distinction  between  two  kinds  of  classical  constraints  can  be 
given  rigorously  via  integrability  of  corresponding  distributions.  From  this  point  of  view, 
a  holonomic  constraint  is  characterized  by  an  integrable  distribution,  and  the  motion  of 
the  system  under  study  is  restricted  to  a  leaf  of  the  corresponding  foliation.  Then,  using 
suitable  local  coordinates,  one  can  always  describe  the  motion  of  the  constrained  system 
by  a  set  of  unconstrained  dynamic  equations  on  a  lower  dimensional  space.  Because 
of  this,  in  analytical  mechanics,  establishing  the  dynamic  equations  and  determining 
the  motion  for  systems  with  holonomic  constraints  are  not  considered  as  challenging 
problems  in  general.  However,  from  a  practical  point  view,  it  is  always  of  interest  to 
understand  how  the  constraints  affect  the  complexity  of  dynamic  motions  of  a  specific 
system.  In  particular,  such  understanding  usually  generates  sophisticated  strategies  for 
design  and  control  of  the  system.  On  the  other  hand,  using  geometric  interpretations, 
classical  nonholonomic  constraints  can  be  characterized  by  nonintegrable  distributions  on 
the  configuration  space.  Then  the  real  motion  of  a  mechanical  system  with  nonholonomic 
constraints  must  satisfy  the  condition  that  the  tangent  vector  of  a  path  in  configuration 
space  belongs  to  such  distribution. 

Treating  the  classical  constraints  in  the  geometric  way  as  above,  one  can  think  of 
more  general  questions  related  to  the  theory  of  distribution  on  manifolds  and  various 
nonholonomic  problems.  For  instance,  it  is  known  that  the  motion  of  a  Lagrangian 
mechanical  system  can  be  described  completely  by  a  special  vector  field  on  the  tangent 
bundle  of  its  configuration  space  (i.e.,  via  second  order  differential  equations).  Therefore, 
from  a  geometric  point  of  view,  it  is  more  natural  to  consider  the  constraints  in  terms  of 
distributions  on  the  tangent  bundle.  In  doing  so,  certain  constraints  which  are  not 
linear  in  velocity  but,  for  example,  those  described  by  the  null  set  of  some  set  of 
functions  on  tangent  bundle,  can  also  be  given  geometric  interpretations.  Hence,  in 
a  certain  sense,  a  geometric  treatment  generalizes  the  range  of  constrained  problems  we 
can  deal  with.  A  natural  question  that  follows  is  that  of  reformulating  the  Lagrange- 
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d’Alembert  principle  in  an  intrinsic  form.  In  [46,47],  this  and  related  issues  are  carefully 
studied.  There  is  another  viewpoint  in  seeking  a  generalization  of  constrained  problems 
in  mechanics.  Notice  that  the  concept  of  distribution  in  differential  geometry  is  concerned 
only  with  the  mathematical  meaning  of  a  constraint.  In  dealing  with  mechanics,  classical 
constraints  (e.g.,  geometric  and  kinematic  constraints)  are  just  one  means  to  define 
or  construct  related  distributions.  This  suggests  that  one  can  think  of  other  kinds  of 
constraints  coming  from,  for  instance,  conservation  laws  as  a  source  of  distributions.  In 
[5],  the  former  are  called  phenomenological  constraints,  the  latter  are  called  symmetry 
constraints. 

In  mathematics,  general  nonholonomic  problems  are  those  of  determining  a  class  of 
curves  in  terms  of  a  certain  law  or  principle  or  criterion  such  that  it  satisfies  restrictions 
given  by  distributions  on  a  given  space.  From  this  point  of  view,  it  is  clear  that  the 
curves  determined  from  constrained  classical  mechanical  systems  satisfy  the  Lagrange- 
d’Alembert  principle.  One  can  certainly  consider  such  problems  in  terms  of  other  criteria 
or  principle,  for  instance,  Hamilton’s  principle  of  least  action.  Of  course,  it  has  been 
known  that  the  latter  does  not  give  real  motions  for  classical  mechanical  systems  if 
the  constraints  are  nonholonomic.  Recent  research  in  this  direction  leads  to  certain 
nonstandard  problems  of  geodesics  [48].  In  [8]  these  are  referred  to  as  problems  of 
singular  Riemannian  geometry  and  Strichartz  speaks  of  sub-Riemannian  geometry  [45]. 
In  mechanics,  this  problem  is  closely  related  to  the  problem  of  nonholonomic  motion 
planning  or  kinematic  control  [22,23,31]. 

This  dissertation  is  concerned  with  both  types  of  problems  with  constraints 
mentioned  above.  The  main  feature  here  is  that  we  focus  on  systems  which  possess  group 
symmetries.  The  fundamental  mathematical  tools  applied  here  are  those  which  have  been 
widely  used  in  geometric  mechanics,  such  as  Lie  groups  and  Lie  algebras,  symplectic  and 
Poisson  geometry,  Lagrangian  and  Hamiltonian  mechanics,  reductions  and  connection 
theory  on  principal  fiber  bundles.  This  dissertation  is  organized  as  follows. 

In  Chapter  II,  after  reviewing  some  definitions,  notations  and  important  theorems 
in  differential  geometry  and  geometric  mechanics,  we  introduce  an  invariant/intrinsic 
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formulation  of  Lagrangian  mechanics  due  to  Vershik  [46]  and  Vershik  and  Faddeev  [47]. 
Such  a  formulation  makes  it  possible  for  us  to  restate  Lagrange-d’Alembert  principle 
with  constraints  on  second  tangent  bundle  using  the  concept  of  virtual  displacement  in 
classical  mechanics.  It  is  easy  to  show  that  this  formulation  is  equivalent  to  the  one 
using  constraint  reaction  force  considered  in  [46,47].  But,  our  expression  of  Lagrange- 
d’Alembert  principle  for  constrained  systems  turns  out  to  be  very  important  in  Chapter 
V,  in  which  symmetries  are  associated  to  the  systems  under  study. 

In  Chapter  III,  we  study  the  kinematics  and  dynamics  of  the  simplest  coupled 
mechanical  systems  with  holonomic  constraints,  that  is  the  planar,  floating  four-bar 
linkages.  Although  a  floating  four-bar  linkage  is  a  particular  mechanism  with  holonomic 
constraint,  it  illustrates  precisely  what  we  mentioned  earlier,  that  is,  its  dynamic 
properties  in  comparison  with  an  open  chain  are  very  different.  This  chapter  provides 
complete  analyses  including  topological  description  of  the  configuration  space,  symplectic 
and  Poisson  reductions  of  the  dynamics  and  bifurcation  of  relative  equilibria. 

Chapter  IV  addresses  reduction  and  explicit  solvability  of  optimal  control  problems 
on  principal  bundles  with  connections  from  a  Hamiltonian  point  of  view.  The  particular 
mechanical  system  we  consider  is  a  rigid  body  with  two  oscillators.  The  optimal  control 
problem  is  posed  by  considering  a  special  nonholonomic  variational  problem,  in  which 
the  nonholonomic  distribution  is  defined  via  a  connection.  The  necessary  conditions  for 
the  optimal  control  problem  are  determined  intrinsically  by  a  perturbation  method  and 
a  Hamiltonian  formulation.  The  necessary  conditions  admit  the  structure  group  of  the 
principal  bundle  as  a  symmetry  group  of  the  system.  Thus  the  problem  is  amenable 
to  Poisson  reduction.  Under  suitable  hypotheses  and  approximations,  we  find  that  the 
reduced  system  possesses  additional  Abelian  symmetry.  Applying  Poisson  reduction 
again,  we  obtain  a  further  reduced  system  and  corresponding  first  integral.  The  model 
problem  described  here  is  strongly  motivated  by  a  troublesome  phenomenon  of  drift 
observed  in  the  Hubble  Space  Telescope  due  to  thermo-elastically  driven  vibrations 
of  the  solar  panels  arising  from  the  day-night  thermal  cycling  on-orbit.  The  point 
mass  oscillators  in  our  problem  may  be  viewed  as  one-mode  truncations  of  the  elasto- 
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mechanical  problem. 

The  most  important  contribution  of  this  dissertation  to  nonholonomic  Lagrangian 
mechanics  is  contained  in  Chapter  V,  in  which  we  discover  a  reduction  theorem  for  the 
Lagrangian  systems  with  non- Abelian  symmetries  and  certain  nonholonomic  constraints. 
This  chapter  is  motivated  by  Chaplygin’s  reduction  formulation  in  classical  mechanics 
with  Abelian  symmetry.  It  is  known  that,  in  general  for  a  system  with  nonholonomic 
constraints,  one  cannot  get  reduced  dynamics  on  a  lower  dimensional  space  by  elimi¬ 
nating  constraints,  as  one  can  do  for  holonomic  constraints.  Instead,  one  usually  has  to 
expand  the  space  by  bringing  in  more  variables,  i.e.,  Lagrange  multipliers.  But  Chap¬ 
lygin  showed  that  one  can  do  such  reduction  if  the  system  and  constraints  admit  an 
Abelian  symmetry.  A  natural  question  one  can  ask  is  if  this  is  also  possible  for  con¬ 
strained  systems  which  admit  a  non-Abelian  symmetry?  This  question  is  answered  in 
some  detail  in  this  chapter.  The  constraints  here  are  constructed  on  principal  fiber  bun¬ 
dles  with  connections.  Under  our  assumption,  they  take  the  form  of  affine  functions 
in  velocity,  instead  of  linear  ones.  In  using  distribution  theory  to  interpret  constraints, 
one  has  to  consider  the  distributions  on  the  tangent  bundle  of  the  configuration  space, 
rather  than  on  configuration  space.  The  approach  of  constructing  constrained  dynamics 
on  the  second  tangent  bundle  introduced  in  Chapter  II  can  be  directly  applied  here. 
Since  we  consider  constraints  from  principal  connections  which  decompose  the  velocity 
phase  space,  the  dynamics  of  the  system  can  be  described  on  a  horizontal  subspace  and, 
consequently,  on  the  tangent  bundle  of  the  quotient  space  of  the  configuration  space 
with  respect  to  the  symmetry  group.  Then  our  goal  to  reduce  the  dynamics  on  a  lower 
dimensional  space  can  be  reached.  Following  this  idea,  we  obtain  a  reduction  theorem. 
An  important  application  of  our  reduction  theorem  is  the  derivation  of  Lagrangian  re¬ 
duction,  where  the  principal  connection  used  is  the  mechanical  connection  determined 
by  the  conserved  momentum  map.  In  this  application,  our  result  coincides  with  the  one 
in  [27].  Some  representative  examples  are  also  studied  at  the  end  of  this  chapter. 

In  Chapter  VI,  we  reconsider  the  main  results  in  this  dissertation  and  point  out 
some  topics  for  future  research. 
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CHAPTER  II 


PRELIMINARIES 


In  this  chapter,  we  first  review  some  definitions,  notations  and  important  theorems 
in  differential  geometry  and  geometric  mechanics.  Concepts  and  results  that  will  be  used 
frequently  in  the  following  chapters  are  the  main  focus  of  this  chapter.  The  second  aim 
of  this  chapter  is  to  introduce  an  invariant  formulation  of  Lagrangian  mechanics  due 
to  Vershik  [46]  and  Vershik  and  Faddeev  [47].  In  particular,  we  are  interested  in  the 
statement  of  Lagrange-d’Alembert  principle  for  systems  with  constraints.  An  important 
feature  of  Section  2.2  is  that  we  work  with  constraints  on  vector  fields  that  behave  as 
virtual  displacements  in  classical  mechanics.  Such  a  presentation  leads  to  the  form  of 
Lagrange-d’Alembert  principle  expressed  in  terms  of  reaction  forces  due  to  constraints, 
which  is  the  form  given  in  [46,47].  Our  expression  of  Lagrange-d’Alembert  principle 
for  constrained  systems  will  play  a  crucial  role  in  Chapter  V,  where  symmetries  will  be 
associated  to  the  systems  under  study. 
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2.1  Geometric  Mechanics 

In  this  section,  we  recall  some  basic  definitions,  commonly  used  notations  and 
important  theorems  in  geometric  mechanics,  which  will  be  cited  in  the  following  chapters. 
The  useful  references  are  [1,2]. 

2.1.1  Lie  Groups  and  Group  Actions 

A  Lie  group  G  is  a  differentiable  manifold  and  a  group,  for  which  the  group 
operations,  product  and  inverse,  are  differentiable  maps.  We  denote  by  Rg  :  G 
G  :  h  i-»  hg  and  Lg  :  G  — ►  G  :  h  t->  gh  right  and  left  translation,  respectively.  One 
can  show  that  the  tangent  space  of  G  at  identity  e,  TeG ,  forms  a  Lie  algebra  which  is 
isomorphic  to  the  set  of  left  invariant  vector  fields  on  G  (see  [1],  Chapter  4).  We  denote 
by  Q  the  Lie  algebra  of  G.  The  Lie  bracket  of  f  and  r]  is  denoted  as  [  f,  V  ] ,  V£,  rj  £  Q. 

Let  Q  be  a  smooth  manifold.  A  (left)  action  of  a  Lie  group  G  on  Q  is  a  smooth 
mapping 

#  :  G  X  Q  -  Q  :  ( g,q )  ->  $(g,  q)  =  *g(q)  £ g-q  (2-1.1) 

such  that  4>(e,  <7)  =  q,  Vq  £  Q ,  and  <b(g,$(h,q))  =  §(gh,q),Vg,h  £  G,Vq  £  Q  .  Certain 
induced  actions  of  G  on  manifolds  M  are  of  particular  interest.  Let  M  =  TQ ,  the 
tangent  bundle  of  Q .  The  tangent  lift  action  of  G  on  TQ  is  given  by 

$T  :  G  X  TQ  -»  TQ  :  (g,  vq)  ~  (2.1.2) 

where  Tq$q  denotes  the  linearization  of  <&g  at  q  £  Q.  Let  M  =  T*Q ,  the  cotangent 
bundle  of  Q .  The  cotangent  lift  action  of  G  on  T*Q  is  given  by 

<J>T'  :  G  x  T*Q  T*Q  :  (<7,  aq)  (2-1-3) 

where  is  the  linear  dual  of  Tq$q.  Let  M  —  Q .  The  adjoint  action  of  G  on  Q  is 

defined  by 

Ad:G  xQ  -*  Q  :  (</,0  ~  Ad^  =  Te(Rg-.Lg)£.  (2.1.4) 

Let  M  =  Q* ,  the  dual  of  Lie  algebra  Q  of  G .  Then  the  co-adjoint  action  of  G  on  Q *  is 
defined  by 

Ad*  :  G  x  Q*  —>■  G*  :  (g,  u)  Ad (2.1.5) 
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where  the  operation  Ad*  is  given  by  (Ad*i/,r])  =  (l>,  AdgT]) ,  for  any  77  E  Q . 

Let  exp{t£)  be  the  integral  curve  of  the  left  invariant  vector  field  on  G  associated 
to  £  E  Q .  For  a  group  action  as  in  (2.1.1)  and  £  E  Q ,  the  vector  field  defined  by 


<*<»>  -  5 


$(exp(t£),q),  VqeQ 


t= 0 


(2.1.6) 


is  called  infinitesimal  generator  or  fundamental  vector  field  of  the  action  corresponding 
to  £.  One  can  show  that  if  the  group  G  acts  on  Q  freely  and  effectively,  the  mapping, 
£  t->-  £q  is  an  isomorphism  of  the  Lie  algebra  Q  into  the  Lie  algebra  2J(Q),  the  set  of  all 
smooth  vector  fields  on  Q  (cf.  [33]).  For  the  adjoint  action  defined  as  in  (2.1.4), 

£g(v)  =  “dtV£[£,V]  Vt 7  €  0.  (2.1.7) 


For  the  co-adjoint  action  defined  in  (2.1.5), 


&;*(<*)  =  —ad^a,  Va  G  Q* ■ 


(2.1.8) 


2.1.2  Hamiltonian  Systems 

We  first  consider  Hamiltonian  systems  on  a  symplectic  manifold.  A  symplectic 
manifold  (M,  fi)  is  an  even  dimensional,  smooth  manifold  M  together  with  a  closed, 
nondegenerate  two-form  ,  called  symplectic  structure ,  on  M .  Given  a  smooth  function 
H ,  called  the  Hamiltonian  or  energy  function ,  on  M ,  the  Hamiltonian  vector  field  Xh 
is  the  smooth  vector  field  on  M  satisfying 

n(XH,  Y)  =  dH  •  Y  'VFe  X(M).  (2.1.9) 

Let  Q  be  an  n -dimensional  smooth  manifold,  which  represents  the  configuration 
space  of  a  mechanical  system.  Let  M  =  T*Q  be  the  momentum  phase  space, 
coordinatized  locally  by  z  =  (q\,  ■  ■  •,  qn,Pu  •  •  •  ,pn)  ■  One  can  show  that  a  symplectic 
structure  on  T*Q  is  a  canonical  two- form  which  is  represented  locally  by  flo  = 
dqi  A  dpi  (cf.  Theorem  3.2.10  in  [1]).  Then  the  Hamiltonian  vector  field  is  given 
locally  by  Xh  =  A  VH ,  where  matrix  A  = 
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One  can  also  work  with  a  symplectic  structure  on  M  =  TQ ,  the  velocity 
phase  space.  Given  a  smooth  function  L ,  called  Lagrangian ,  on  TQ ,  the  Legendre 
transformation  is  defined  by  fiber  derivative  F L  :  TQ  — *■  T*Q  associated  to  L.  Let 
f 'll  =  (FL)*CIq.  One  can  show  that  if  L  is  regular,  D,l  is  symplectic.  Hence,  (TQ,  fijr,) 
is  a  symplectic  manifold.  Let  ( q,v )  €  TQ  and  the  energy  function 

Hl(q,  v)  =  F L(v)  ■  v  -  L(q ,  v). 

A  vector  field  XuL  ,  called  Lagrangian  vector  field ,  on  TQ  is  determined  by 

li(xHl  ,  y)  =  dHL  •  y,  w  e  %(tq).  (2.1.10) 

One  shows  that  if  L  is  hyperregular,  i.e.,  F L  is  a  diffeomorphism  and  the  Hamiltonian 
H  =  Hl  o  (FL)-1 ,  then  the  base  integral  curves  given  by  bundle  projections  of  the 
integral  curves  of  Xhl  and  Xu  on  Q  are  identical. 

We  next  consider  Hamiltonian  systems  defined  on  Poisson  manifolds.  A  Poisson 
manifold  (M,  {•,•})  is  a  smooth  manifold  M  together  with  an  3fJ-bilinear  map  on 
C°°(M ): 

:  C°°(M )  x  C°°(M)  -  C°°(M), 

referred  to  as  Poisson  structure  or  Poisson  bracket ,  which  satisfies  the  following  axioms: 
for  fi  e  C°°(M),i=  1,2,3, 

1)  Skew  Symmetry:  {/i,/2}  =  -{/2,/i}; 

2)  Leibniz’  Rule:  {/i,/2/3}  =  {/i,  /2}/3  +  M/1,/3}; 

3)  Jacobi  Identity:  {/1,  {/2,  /3}}  +  {/s,  {/1,  /2>}  +  {/j,  {/a,  fi}}  =  0. 

A  Poisson  structure  can  be  expressed  uniquely  through  a  contravariant  skew-symmetric 
two-tensor  A  on  M ,  called  Poisson  tensor ,  such  that 

{/,0}(2)  =  Hz){df(z),dg(z)),  Vz  6  M.  (2.1.11) 

If  M  is  an  n-dimensional  manifold  (n  is  finite),  A  is  given  by  an  n  X  n  skew-symmetric 
matrix,  also  denoted  as  A ,  and  the  Poisson  bracket  can  be  expressed  as 

{/,</}(*)  =  XfT(z)A(z)Vg(z).  (2.1.12) 
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Given  a  smooth  function  H ,  referred  to  as  Hamiltonian ,  on  M ,  the  Hamiltonian 
vector  field  on  (M,  {•,  •})  is  defined  by  the  relation 

XH(f)  =  {f,H}.  (2.1.13) 

The  uniqueness  of  such  a  vector  field  can  be  proved,  cf.  [30].  Let  4>(t)  be  the  flow  of 
Xh  and  Z  be  a  coordinate  function,  we  then  have 

(2.1.14) 

In  the  finite  dimensional  case,  we  have 

z  =  A(z)VH(z),  (2.1.15) 

where  z(t)  =  Z(4>(t)).  A  function  C  on  M  is  a  Casimir  function  if  {C,  F}  —  0  for  all 
F  €  C°°(M),  i.e.,  C  is  a  constant  along  the  flow  of  all  Hamiltonian  vector  fields. 
Remark  2.1.1:  Every  symplectic  manifold  is  a  Poisson  manifold  by  defining  the 

Poisson  structure  to  be  {/,  g}  =  Cl(Xf,Xg).  It  is  a  theorem  of  Lie  and  Kirillov 

that,  in  general,  Poisson  manifolds  can  be  decomposed  into  the  symplectic  leaves  of 
a  generalized  foliation  [25].  As  an  example,  the  Lie-Berezin-Kirillov-Kostant-Souriau 
Poisson  structures  on  the  dual  space  of  Lie  algebra  Q ,  Q* ,  is  considered.  One  can  show 
that  the  symplectic  leaf  through  each  point  p  in  Q*  is  the  coadjoint  orbit  through  [i 
(cf.  [21]).  | 

2.1.3  Symmetry  and  Reduction 

Let  (M,{  ,  })  be  a  Poisson  manifold  and  let  Lie  group  G  act  on  (M,  {  ,  }) 
canonically,  that  is,  for  each  g  G  G,  the  map  4>9  :  M  — *  M  preserves  the  Poisson 
structure,  i.e., 

{/i,/2}  o  $3  =  {/1  o  $„/2  O  €  C°°(M).  (2.1.16) 

A  momentum  mapping,  J  :  M  — *•  Q*  of  the  action  is  defined  by 

<j(z),0  =  J(0(*),  (2-1-17) 
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for  all  £  £  Q  and  z  6  M ,  where  a  linear  map  J  :  Q  — ►  C°°(M)  is  assumed  to  exist  such 
that  Xj(£)  =  £jvf ,  or  from  (2.1.13), 

{/,  /({)}  =  £m(/),  v/  e  (2.1.18) 

A  momentum  map  J  is  called  Ad*  - equivariant  if  J  o  =  Ad*_jJ,  where  Ad*  is  the 
co-adjoint  action  defined  in  (2.1.15). 

It  is  well-known  that,  if  the  Hamiltonian  H  is  G -invariant,  the  momentum  map 
provides  a  first  integral  for  the  system.  This  result  is  known  as  Noether’s  theorem  (in 
Hamiltonian  version)  which  is  stated  below. 

Theorem  2.1.2:  If  the  Lie  group,  G ,  acting  canonically  on  the  Poisson  manifold 
M  admits  a  momentum  mapping  J  :  M  — ►  Q*  and  H  6  C°°(M)  is  G-invariant,  i.e., 
H  o  =  H  or  £m{H)  =  0  for  all  £  6  Q ,  then  J  is  a  constant  of  the  motion  for  H ,  i.e., 
J  o  <f>t  =  J ,  where  <j>t  is  the  flow  of  Xh  ■  I 

Next,  we  introduce  the  general  framework  of  Poisson  reduction.  Let  a  Lie  group  G 
acts  on  Poisson  manifold  (M,{  ,  }m)  freely  and  effectively.  Let  tt  :  M  — ►  M/G  be  the 
canonical  projection.  Here  we  assume  the  quotient  space  M/G  is  a  smooth  manifold. 
(M,  {  ,  }m)  is  Poisson  reducible  if  M/G  has  a  Poisson  structure  {  ,  }m/G  which  satisfies 

{/O  7T,/lO  7r}M  =  h}M/G  O  7T,  (2.1.19) 

for  f,h  G  C°°(M/G).  If  the  Hamiltonian  H  on  M  is  G-invariant,  the  reduced 
Hamiltonian  H  on  M/G  is  defined  by 

H  o  tt  =  H.  (2.1.20) 

In  addition,  the  Poisson  reduced  Hamiltonian  vector  field,  Xfj  is  given  by 

XhU)  =  {/,  V/  e  C°°(M/G).  (2.1.21) 

Remark  2.1.3:  Another  point  of  view  of  reduction  in  geometric  mechanics  is  the 
symplectic  reduction.  For  the  detail  of  its  framework,  cf.  [1,28],  and  for  an  application, 
tf-  [43]-  I 


11 


2.1.4  Simple  Mechanical  Systems  with  Symmetry 

A  simple  mechanical  system  with  symmetry  is  a  4-tuple  (Q,  K,  V,G),  where, 

(i)  Q  is  a  smooth  manifold,  the  configuration  space  of  the  system; 

(ii)  K  is  Riemannian  metric  on  Q ,  whose  value  on  TqQ  x  TqQ ,  for  each  q  G  Q ,  is 
written  as  K(q)(vq,wq),  Vvq,wq  G  TqQ .  The  kinetic  energy  of  the  system  is  then 
written  as  \  K(q)(vq,  vq) ; 

(iii)  V  is  a  function  on  Q  ,  the  potential  energy; 

(iv)  G  is  a  connected  Lie  group  with  an  action  $  :  G  X  Q  — *  Q  which  leaves  both  the 
Riemannian  metric  and  the  potential  energy  invariant,  i.e.,  for  each  q  in  Q  and  g 
in  G, 

I 

K(*g(l))(Tq*q  -vq,Tq$q  -wq)  =  K(q)(vq,wq),  Vvq,wqeTqQ  (2.1.22) 
and  V(S,(?))  =  V(q). 

For  a  given  simple  mechanical  system  with  symmetry  (Q ,  K,V,G) ,  the  associated 
Lagrangian  is  defined  by 

L:TQ^U:  vq  ~  L(vq)  =  ^K{q)(vq,  vq)  -  V  o  r(vq),  (2.1.23) 

where  r  :  TQ  -»•  Q  is  the  canonical  tangent  projection.  The  Legendre  transform  F L  of 
L  is  given  here  by  the  vector  bundle  isomorphism 

Kb  :  TQ  -»  T'Q  (2.1.24) 

satisfying  Kb(vq)  •  wq  =  K(q)(vq,wq),Vvq,  wq  G  TqQ.  The  Hamiltonian  H  :  T*Q  —*■  K 
is  defined  to  be 

H(aq)  =  l-K(q)((KY\^q),(I^)-\aq))  +  VoT*(aq\  (2.1.25) 

where  r*  :  T*Q  — ►  Q  is  the  canonical  cotangent  projection.  The  standard  momentum 

map  can  be  defined  either  by  J  :  T*Q  — ►  Q *  for  the  lifted  action  4>T  given  in  (2.1.3) 
such  that,  for  each  f  G  Q  and  aq  G  TqQ , 

(J(a?),0  =  (a„fQ(?)>,  (2.1.26) 
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or  by  J11  :  TQ  — »  Q*  for  the  lifted  action  <1>T  given  in  (2.1.4)  such  that,  for  each  f  £  Q 


and  vq  G  TqQ  , 


=  K(q)(vg,£Q(q)). 


(2.1.27) 


One  can  show  that  both  momentum  maps  are  Ad*-equivariant.  (In  the  above  equations 
and  later,  we  use  {•,  •)  to  denote  the  natural  pairing  of  the  elements  in  a  vector  space 
and  its  dual.  We  sometimes  also  use  to  indicate  this  pairing.) 

Remark  2.1.4:  Addition  of  linear  in  velocity  term  to  the  Lagrangian  of  a  simple 
mechanical  system  with  symmetry  gives  rise  to  gyroscopic  systems  with  symmetry.  For 
a  careful  study  of  such  systems,  see  [49].  ■ 


2.2  Lagrangian  Mechanics  with  Constraints 

In  this  section  we  present  Lagrange- d’Alembert  principle  for  Lagrangian  systems 
with  constraints  in  invariant  form.  We  first  recall  some  important  definitions  and  facts. 
The  proofs  for  most  of  those  facts  can  be  found  in  [46,47]  and  will  be  omitted  here. 
Although  our  treatment  follows  these  references,  one  should  notice  the  differences  of 
certain  sign  conventions.  In  [50]  the  intrinsic  Lagrangian  viewpoint  is  exploited  in  the 
study  of  gyroscopic  stabilization. 

Let  Q  be  a  smooth,  n -dimensional  manifold  and  TQ  the  tangent  bundle  of  Q 
with  the  canonical  projection  7r  :  TQ  —>  Q.  Let  TTQ  be  the  second  tangent  bundle 
of  Q  and  TV  :  TTQ  —*■  TQ  be  the  tangent  map  of  n.  In  local  coordinates,  one  has 
7r  :  (g,  v)  q  at  each  point  (q,v)  in  TQ  and 

TV(q,v)  ■  T{q,v)TQ  -*■  TqQ 


(2.2.1) 


(u,  w)  I~>  ( q,u ), 


where  u,w  G  TqQ  . 


Let  3S(TQ)  be  the  set  of  vector  fields  on  TQ .  A  vector  field,  X  £  X(TQ ),  is 
special  if  TV  o  X  is  identity.  In  local  coordinates,  at  each  point  (g,u)  G  TQ , 


X(q,v)=  (v,w),  (2.2.2) 

for  some  w  G  TqQ .  It  is  clear  that  a  special  vector  field  defines  a  differential  equation  of 
second  order. 
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Let  T^q  v)TQ  be  a  subsPace  of  T(q,v)TQ  such  that  each  vector  in  T^^TQ  is 
tangent  to  the  fiber  of  TQ .  In  local  coordinates,  every  vector  in  T^q  vyCQ  has  the  form 
of  (0,  w)  for  some  w  €  TqQ  .  This  subspace  is  referred  to  as  vertical  tangent  subspace.  It 
follows  that  one  can  identify  TqQ  and  T^q  v)TQ  by  isomorphism  through  the  mapping 


7(9, v)  '•  TqQ  ►  T(q,v)TQ 


w  i-*-  (0,  w). 


(2.2.3) 


A  vector  field  Xv  £  X(TQ)  is  vertical  if,  at  each  point  (q,v)  in  TQ ,  Xv(q,v)  £ 
TV  ,TQ.  A  vector  field  XpT  €  X(TQ)  is  principal  if  it  is  vertical  and,  at  each  point 
(q,v)  in  TQ ,  Xpr(q,v)  =  7(9)„)  •  v,  or  explicitly,  Xpr(q,v)  =  (0,v).  Thus,  there  is  a 


unique  such  vector  field. 

We  next  consider  the  corresponding  geometric  objects  and  maps  on  the  dual  spaces. 
Let  T*Q  and  T*TQ  be  the  cotangent  bundle  of  Q  and  TQ ,  respectively.  Let  tu1(TQ) 
be  the  totality  of  one- forms  on  TQ .  A  1-form  u  6  ter  1(TQ)  is  horizontal  if  it  annihilates 
vertical  vector  fields  on  TQ .  In  local  coordinates,  (a,  13)  £  T*qvyTQ  for  a,/3  £  T*Q  is 
horizontal  if  and  only  if  (3  —  0.  Let  ^  :  T*Q  -*  T*q^TQ  be  the  linear  dual  of 

the  map  T7r(,iV)  given  in  (2.2.1).  If  a  £  T*Q ,  one  can  show  that 


(TK)*{q<v)(a)  =  (a,  0), 


(2.2.4) 


which  is  horizontal.  Indeed,  if  (u,w)  €  T(g>v)TQ, 


((Tir)fqtV)(a),(u,w))  =  (a,(2V)(,,w)(tt,ti>))  =  (a,u)  =  ((a,  0),  (u,  w)). 

Moreover,  let  j^v)  :  T^q  v)TQ  ->  T*Q  be  the  dual  of  the  map  7(,,v)  given  in  (2.2.3). 
One  can  show  that 

7  {q,v){a,P)  =  P,  (2.2.5) 

where  a, (3  £  T*Q.  Indeed,  (■y^q  v)(a,P),u)  =  {(a,0),~f(q,v)u)  =  ((Q^)) =  (P,u)- 
Define  a  bundle  map  r  :  T*TQ  — »•  T*TQ  by 


r  =  (T7t)*7*. 


(2.2.6) 


From  (2.2.4)  and  (2.2.5)  one  shows  that,  for  (a,0)  £  T*q  v)TQ , 

T(q,v){a,P)  =  (0,0), 


(2.2.6)' 
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which  is  horizontal.  (Indeed,  7-(?)„)(a,/3)  =  (TV) *7^  v^(a,(3)  =  (TV)*/?  =  (/3,0).) 
Remark  2.2.1:  It  should  be  kept  in  mind  that  the  horizontal/ vertical  operations 
defined  in  this  section  are  based  on  the  structure  of  the  second  tangent  bundle  of  a 
manifold.  They  are  different  from  the  horizontal/vertical  operations  on  the  principal 
fiber  bundle,  which  will  be  introduced  in  Chapter  IV.  | 

We  are  now  ready  to  define  some  geometric  objects  to  present  Lagrange- d’Alembert 
principle.  Let  the  smooth  manifold  Q  be  the  configuration  space  of  a  mechanical  system 
and  let  a  smooth  function  L  on  TQ  be  the  Lagrangian.  Define  a  horizontal  1-form  by 

ujl  =  r  o  dL,  (2.2.7) 

where  dL  :  TQ  — >  T*TQ  is  exterior  derivative  of  L.  In  local  coordinates,  dL(q,v )  = 
(D\L(q,  v),  D2L(q,  v)) ,  where  D,L(q,v )  is  the  Frechet  derivative  of  L  relative  to  z-th 
argument  of  L .  Then, 

uL(q,v)  =  (D2L(q,v),  0).  (2.2.7)' 

Define  a  2-form,  Lli,  by 

SlL  =  -  duL,  (2.2.8) 

where  du>L  is  the  exterior  derivative  of  u>i.  From  (2.2.7)'  it  is  easy  to  obtain  the 

expression  for  LIl  in  local  coordinates,  that  is, 

ttL(q,  v)((ui,  wi),  ( u2,w2 ))  =  (DiD2L{q,  v)  ■  u2)  ■  ux  +  ( D2D2L(q ,  v )  •  w2)  •  u2 

-  ( D2D2L(q,v )  •  Ui)  •  u2  (D2D2L{q,v)-  wQ  ■ u2 , 

(2.2.8)' 

where  (tz,-,u;,)  G  T(g<v)TQ,i  =  1,2.  Define  the  energy  function  Hl  on  TQ  by 

Hl  =  dL(Xpr)  -  L,  (2.2.9) 

which,  in  local  coordinates,  can  be  written  as 

^l(9,  n)  =  D2L(q,  v)  ■  v  -  L(q,  v).  (2.2.9)' 

Finally,  define  Lagrangian  force  on  a  special  vector  field  X ,  denoted  as  Fl(X),  by 

FL(X)  =  nL(X,-)-dHL.  (2.2.10) 
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One  can  show  that  Fl(X)  G  zvl(TQ)  is  horizontal  and,  from  (2.2.8)'  and  (2.2.9)',  its 
expression  in  local  coordinates  is,  for  X(q,v)  =  (v,w), 

FL(X){q,v)(u i,it2)  =  ( -D\D2L{q,v )  •  v  -  D2D2L(q,v)-w  +  DxL(q,v ))  •  ti1;  (2.2.10)' 
for  any  ( ux,u2 )  G  T{q,v)TQ  . 

Using  the  above  notions,  the  Lagrange-d’Alembert  principle  for  Lagrangian  system 
without  constraints  is  given  as  follows. 

Lagrange-d’Alembert  Principle  ( without  Constraints)  2.2.2: 

On  the  special  vector  field  which  determines  the  real  trajectory  of  motion,  the  sum 
of  the  Lagrangian  force  and  exterior  force  vanishes  on  any  vector  field  on  the  tangent 
bundle  of  configuration  space.  | 

More  explicitly,  this  principle  says  that  the  special  vector  field  X  whose  integral 
curves  are  the  trajectories  of  motion  in  TQ  satisfies 


{FL(X)  +  ue,Z)  =  0,  (2.2.11) 

for  any  vector  field  Z  G  X(TQ) ,  where  ue  is  a  horizontal  one-form  on  TQ  denoting  the 
exterior  force.  Following  (2.2.10)',  the  local  expression  of  (2.2.11)  can  be  given  as,  at 
each  point  (q,  u)  G  TQ  with  Z{q,  u)  =  ( u ,  w)  and  we(q,  v )  =  (ae,  0) , 

(~DiD2L(q,  v)  ■  v  -  D2D2L(q,  v)  ■  w  +  D\L(q,v )  +  ae)  ■  u  =  0 


or,  by  letting  v  =  q  and  w  =  v , 


(  —  D2L(q,v)~  D\L(q,v))  ■  u  =  ae  ■  u. 


(2.2.12) 


For  finite  dimensional  systems,  (2.2.12)  gives  the  classical  Euler- Lagrange  equation,  (in 
vector  form) 

A  r)T.  FIT. 

(2.2.13) 


=  aP 


dt  dq  dq 

We  next  introduce  an  important  mapping  and  two  lemmas  which  will  be  used  later. 
If  the  Lagrangian  L  is  regular,  one  shows  that  is  nondegenerate.  This  gives  rise  to 
a  well  defined  one-to-one  mapping,  11^  :  w1(TQ)  — *•  X{TQ),  determined  by 


fiL(nL(«),ir)  =  "(n 


(2.2.14) 
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for  any  one-form  u  and  any  vector  field  Y  on  TQ .  The  following  lemma  gives  an 
important  property  of  this  mapping  [47]. 

Lemma  2.2.3:  Let  L  be  regular.  If  u  is  horizontal  one-form  on  TQ  and  u  ^  0,  then 
Il£,(u;)  ^  0  and  II^u;)  is  a  vertical  vector  field  on  TQ .  I 

Define  the  Lagrangian  vector  field,  Xhl  ,  by 

XHl  =  n  L(dHL),  (2.2.15) 

where  Hl  is  the  energy  function  given  in  (2.2.9).  The  following  useful  lemma  is  also 
shown  in  [47]. 

Lemma  2.2.4:  Xhl  is  a  special  vector  field  on  TQ .  | 

Before  we  proceed  to  the  discussion  of  constrained  systems,  we  note: 

Remark  2.2.5:  One  can  show  that  the  geometric  objects  and  Xhl  here  are 

exactly  the  same  as  those  in  the  Section  2.2.1.  | 

Remark  2.2.6:  It  is  very  important  to  realize  that  (2.2.11)  means  that  the  work 
done  by  the  sum  of  Lagrangian  force  and  exterior  force  on  Z  is  zero.  This  work  is,  in 
fact,  virtual  work  and  Z  is  virtual  displacement  (on  TQ)  in  the  literature  of  classical 
mechanics.  Therefore  (2.2.11)  is  nothing  but  the  principle  of  virtual  work.  It  is  also 
important  to  note  that  when  the  motion  of  the  system  is  under  certain  constraints 
(holonomic  or  nonholonomic),  this  principle  is  also  true  provided  that  the  vector  field 
Z  satisfies  further  conditions  relating  to  the  constraints.  This  is  what  we  will  consider 
next.  | 

Before  defining  constraints,  we  first  give  our  definitions  of  a  distribution  and  its 
annihilator. 

Definition  2.2.7:  Let  M  be  a  smooth  manifold  and  TM  be  its  tangent  bundle  with 
projection  pr  :  TM  — *  M .  Let  M'  C  M  be  a  smooth  submanifold  of  M .  A  distribution 
D  of  TM  on  M'  is  a  subbundle  of  TM  over  M' ,  i.e.,  D  C  pr~l(M')  with  projection 
pr1  =  pr\o .  The  annihilator  of  D  ,  a  codistribution  on  M'  denoted  as  D x  ,  is  a  subbundle 
of  T*M  over  M'  such  that,  for  any  m  6  M' ,  uj  £  D ^  and  v  £  Dm,  {u>,v)  =  0.  | 

Remark  2.2.8:  We  make  two  assumptions: 

(1)  In  the  following,  for  simplicity,  we  assume  M'  =  M .  This  assumption  will  be  relaxed 
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later  (after  Remark  2.2.18)  when  we  consider  a  special  case  in  which  M'  is  a  leaf  of  a 
foliation  of  M .  In  this  particular  case,  D  =  TM'  and  distribution  and  co-distribution 
given  below  can  be  naturally  restricted  to  M' . 

(2)  We  assume  that  all  distributions  and  co-distributions  are  smooth  (i.e.,  differentially 
depend  on  the  points  of  M)  and  nonsingular  (i.e.,  have  fixed  dimension).  By  this 
assumption,  it  is  clear  that  if  dim(M)  =  n  and  dim(D)  =  d,  dim(D-L)  —  2n  —  d.  | 
We  now  introduce  Lagrangian  systems  with  constraint  on  an  n -dimensional 
configuration  space  Q.  Classically,  a  constraint  is  defined  by  a  subspace,  D',  of  TQ, 
which  is  locally  given  by  m  independent  functions,  say  <f>k(q ,  v ),  k  =  1,  •  •  • ,  m,  on  TQ  , 
i.e., 

D'  =  {( q ,  v)eTQ\  <f>k(q,  v)  =  0,  k  =  1,  •  •  • ,  m)  (2.2.16) 

If  we  further  assume  that  d>fc,s  are  linear  in  i),  fl'  is  a  distribution  of  TQ.  Then  a 
Lagrangian  system  compatible  with  the  constraint  is  one  such  that  at  any  point  q(t)  on 
the  trajectory  of  the  motion  {q(t),t  6  [0,T],T  >  0},  (q(t),v(t))  =  (q(t),q(t))  E  D' . 

However,  as  we  have  seen,  the  geometric  treatment  of  Lagrangian  mechanics  is  to 
construct  the  dynamics  of  the  system  on  the  second  tangent  bundle  of  Q .  Therefore, 
in  general,  constraint  should  be  defined  through  a  submanifold,  or  particularly,  a 
distribution  on  TQ.  (Caution:  This  is  still  a  special  case.  See  Remark  2.2.8  (1)). 
A  Lagrangian  system  compatible  with  constraint  will  mean  that  the  dynamics  (or  the 
special  vector  field)  belongs  to  this  distribution.  An  equivalent  way  to  define  the 
constraint  is  to  consider  it  as  a  codistribution  on  TQ  such  that  it  annihilates  the  special 
vector  fields  on  TQ  given  by  the  motion  of  the  system.  This  is  what  we  will  do. 
Definition  2.2.9:  A  Constraint  on  the  tangent  space  TQ  is  a  codistribution  0  on 
TQ .  A  Lagrangian  system  compatible  with  constraint  0  is  a  special  vector  field  X  on 
TQ,  which  is  annihilated  by  0  at  every  point  on  TQ,  i.e.,  0(A)  =  0.  | 

In  local  coordinates,  at  each  point  ( q,v )  in  TQ, 

Q(q,v)  =  span{9k{q ,  v)  —  (ak,flk),  k  =  1,  •  •  • ,  m),  (2.2.17) 

where  a^,/3'fe  e  T*Q,  k  —  1 ,  —  ,m,  and  {&k}  are  linearly  independent,  which  can, 

therefore,  be  viewed  as  a  basis  for  0  . 
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Using  the  bundle  mapping  r  defined  in  (2.2.6)  we  can  define  the  constraint 
reactions  on  TTQ  and  other  related  notions  as  follows. 

Definition  2.2.10:  The  constraint  reaction  is  the  horizontal  codistribution,  rQ.  An 
element  of  r0  is  referred  to  as  a  constraint  reaction  force.  A  constraint  is  admissible 
if  dim(rO)  =  dim(Q).  A  constraint  is  called  ideal  if  it  annihilates  the  principal  vector 
field  XPr.  1 

From  (2.2.17)  and  (2.2.6)',  in  local  coordinates,  a  constraint  is  admissible  if 
{/3k,  k  =  1,  •  •  • ,  m }  are  linearly  independent. 

The  following  two  lemmas  are  essential  and  easy  to  prove  [46]. 

Lemma  2.2.11:  If  a  constraint  is  admissible,  then  there  exists  a  special  vector  field 
compatible  with  this  constraint.  | 

Lemma  2.2.12:  If  a  constraint  is  ideal,  constraint  reaction  vanishes  on  loops  lifted  on 
TQ  from  Q.  I 

Remark  2.2.13:  The  conditions  of  these  two  lemmas  insure  the  existence  of  a  special 
vector  field  which  satisfies  the  constraint  and  the  workless  nature  of  constraint  reaction 
force.  Later  on  we  will  always  assume  these  conditions  are  satisfied.  | 

We  are  now  ready  to  state  the  Lagrange-d’ Alembert  principle  with  constraints. 
Lagrange-d’Alembert  Principle  with  Constraints  2.2.14: 

On  the  special  vector  field  which  satisfies  the  constraint  and  determines  the  real 
trajectory  of  motion,  the  sum  of  the  Lagrangian  force  and  exterior  force  vanishes  on 
the  vector  fields  annihilated  by  the  constraint  reaction  on  the  tangent  bundle  to  the 
configuration  space.  | 

Using  the  notations  introduced  earlier  in  this  section,  this  principle  says  that  the 
special  vector  field  X  which  gives  the  dynamics  of  the  system  with  constraint  satisfies, 

(f’i(I)  +  ue,Z)  =  0  (2.Q.18a) 

for  an  exterior  force  ue  and 

0(A)  =  0  (2.2.18b) 

for  any  vector  field  Z  such  that 

(r0)(Z)  =  0.  (2.2.18c) 


19 


Remark  2.2.15:  As  we  have  mentioned  in  Remark  2.2.6,  vector  field  Z  plays  the 
role  of  virtual  displacement.  Then  (2.2.18)  says  that  work  done  by  Lagrangian  force  and 
exterior  force  on  virtual  displacements  is  zero  provided  that  the  virtual  displacements 
are  “perpendicular”  to  the  the  constraint  reaction.  This  is  the  essence  of  Lagrange- 
d’Alembert  principle.  We  sometimes  also  call  the  value  of  Z  a  test  vector.  | 

Following  the  same  procedure  used  to  obtain  (2.2.12)  and  using  the  local  expression 
(2.2.17)  for  0,  (2.2.18)  can  be  represented  locally  as 

(^ DiL{q ,  v )  -  DiL(q,  v))  ■  u  =  ae  ■  u  (2.2.19a) 

with  constraints 

ak  ■  v  +  /3k  •  v  =  0,  &  =  l,---,m,  (2.2.19b) 

for  u  satisfying 

/3k-u  =  0  k  =  1,  •  •  •  ,m.  (2.2.19c) 

The  Lagrange-d’Alembert  principle  can  also  be  stated  in  an  alternative  form  by 
using  the  constraint  reaction  force.  From  (2.2.18)  we  see  that  Z  belongs  to  (r©)-1 
implies  that  Fl(X)  +  ue  belongs  to  r0,  i.e.,  there  is  an  element,  a  constraint  reaction 
force,  w  in  r0  such  that 

Fi,(A)  -f-  uie  -)-  w  =  0.  (2.2.20a) 

Then  the  Lagrange-d’Alembert  principle  says  that  the  special  vector  field  X  determines 
the  real  trajectory  of  motion  for  the  constrained  system  if  it  satisfies  (2.2.20a)  and 

©(A)  =  0.  (2.2.20b) 

The  local  expression  for  (2.2.20)  can  be  obtained  as  follows.  Let  {8k  =  ( ak,pk),k  = 
1,  •  •  • ,  to}  be  the  basis  for  0 .  Then 

771 

a;=^Afc(/3fc,0), 

fc=i 

where  A*,  are  functions  (multipliers)  on  TQ .  Then  following  the  same  notation  as  for 
(2.2.12),  the  equations  of  motion  for  constrained  system  are 

j  771. 

—  D2L(q,v )  -  DiL(q,v)  =  ae  +  ^  A h(3k  (2.2.21a) 
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and 

ak  ■  v  +  (3k  ■  v  =  0,  k  =  l,---,m.  (2.2.21b) 

Remark  2.2.16:  In  classical  mechanics,  both  (2.2.18)  and  (2.2.20)  (or  locally,  (2.2.19) 
and  (2.2.21))  are  used  as  dynamical  equations  for  systems  with  constraints.  In  many 
physical  problems,  (2.2.20)  is  more  often  applied  because  it  determines  the  constraint 
reaction  force.  On  the  other  hand  (2.2.18)  is  more  important  for  many  analytical 
problems.  As  we  shell  see  in  a  later  chapter  where  symmetry  will  be  involved,  the 
reduced  dynamics  can  be  derived  from  (2.2.18).  | 

We  next  consider  the  existence  and  uniqueness  of  a  vector  field  on  TQ  which 
satisfies  Lagrange- d’Alembert  principle.  For  an  unconstrained  Lagrangian  system,  it  is 
clear  that  given  exterior  force,  there  exits  a  unique  special  vector  field  satisfying  (2.2.11) 
if  the  Lagrangian  is  regular,  i.e.,  F2L  is  nondegenerate.  Here  F 2L  :  TQ  — >  L(TQ,T*Q) 
is  the  second  order  fiber  derivative  of  the  Lagrangian  L,  known  as  the  Hessian  of 
L.  Locally,  F 2L(q,v)  =  DiD2L(q,  v) .  For  Lagrangian  systems  with  constraint,  the 
following  proposition  gives  the  answer.  We  will  prove  it  in  detail  following  the  idea 
provided  in  [46]. 

Proposition  2.2.17:  If  the  Lagrangian  L  is  regular  and  the  Hessian  F 2L  is  positive 
definite,  then  for  every  admissible  constraint  0  there  exists  a  unique  special  vector  field 
X  compatible  with  the  constraint,  0(X)  =  0,  and  satisfying  the  Lagrange-d’Alembert 
principle  with  constraint. 

Proof:  Without  loss  of  generality,  we  assume  there  is  no  exterior  force,  i.e.,  ae  =  0  (see 
also  Remark  2.2.18(1)  below).  By  the  definition  of  Lagrangian  force  (2.2.10),  we  consider 
the  existence  and  uniqueness  of  a  special  vector  field  X  for  equation 

LIl{X,  •)  —  dHi,  +  u  —  0, 

where  LIl  and  Hi  are  defined  respectively  in  (2.2.8)  and  (2.2.9).  We  prove  the  assertion 
by  using  local  coordinates. 

Since  the  Lagrangian  L  is  regular,  is  nondegenerate  [1].  Then  there  exists  a 
nondegenerate  one-to-one  mapping  II 1  :  w  1(TQ)  — *  X(TQ)  such  that 

LlL(HL(dHL  -  u>),  Y )  =  ( dHL  -  w)(y), 
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for  all  Y  6  35(TQ).  It  is  clear  that  -u)  is  special  vector  field  since  is 

special  (by  Lemma  2.2.4)  and  nL(u>)  is  vertical  (by  Lemma  2.2.3).  To  bring  II i(dHL-uj) 
into  agreement  with  the  constraint,  we  require 

G(EL(dHL-u))  =  0 

or 

&(xL)  =  ©(nLw), 

where  Xl  =  II If  we  can  prove  that  there  exists  a  unique  u  G  r0  satisfying 
the  above  equation,  the  assertion  is  proved. 

Since  u  is  a  constraint  reaction  force,  there  exists  a  1-form  0  G  0  such  that 
u  —  t9  .  Thus  we  have  to  solve  for  6  in 

Q(Xl)  —  0((IIl  o  r)9).  (2.2.22) 

Let  {0k}™=1  be  a  basis  for  0.  Then  (2.2.22)  is  equivalent  to 

(9k,  XL)  =  (0k,(nLo  t)9)  k  =  1,  •  •  • ,  m.  (2.2.23) 

Let  9  =  Afc0fc,  where  are  functions  on  TQ .  Then  (2.2.23)  is  equivalent 

to 

m 

(9k,XL)  =  Y,\i(0k,(TlLOT)ei),  k  =  (2.2.24) 

i—1 

Now  we  only  need  to  show  that  there  exists  a  unique  solution  {Afc}^L1  for  the  above 


equation,  or  equivalently,  m  X  m  matrix  ( ,  (Ilj^  0  T)9'))  is  invertible.  By  the 
construction  of  operator  11^  (cf.  (2.2.14)), 

nL({UL  o  r)(0),  Y)  =  (t9)(Y),  vy  G  X(TQ).  (2.2.25) 

In  local  coordinates,  at  each  point  (q,v)  G  TQ  ,  let  0(q ,  v)  =  (a,/ 3)  and  Y(q ,  v )  =  (Yi,  Y>) 
where  a,/3  G  T*Q  and  y  G  TqQ,i  —  1,2.  We  then  have  r9(q,v)  =  (/3,  0)  and 

(nL  oT)(0)(q,v)  =  (0,w),  (2.2.26) 


for  some  w  G  TqQ  (cf.  Lemma  2.2.3).  By  the  local  expression  of  f II  (2.2.8)',  Equation 
(2.2.26)  becomes 

{D2DoL(q,v))-w  -Yx  =  -0-Yx. 


22 


This  implies  w  should  be,  since  L  is  regular  and  Y\  is  arbitrary, 


w  =  -(D2D2L(q,  v))  1/3. 

Now  (2.2.10)  has  the  local  expression 

(nL  o  r)(a,  /3)  =  (0,  —(D2D2L)~1(3).  (2.2.27) 

Let  9k(q,v )  =  ( ak,/3k ),  where  ak  and  (3k  E  T*Q .  Then  from  (2.2.27)  we  have 

<0*,(IILor)0‘')  =  -(pk,(D2D2L(q,  v))-1?), 

for  i,  k  =  1,  •  •  • ,  to.  Now  replacing  the  components  of  right-hand-side  of  (2.2.24)  by  the 
right-hand-side  of  the  above  equation,  we  get 

m 

(dk,XL)  =  -J2  A ,</?*,  ( D2D2L(q ,  n))-1^),  k  =  1,  •  •  • ,  m.  (2.2.28) 

i=r 

Note  that,  since  0  is  admissible,  the  independence  of  {9k}  implies  the  independence 
of  {fik} .  Also,  that  D2D2L(q,v)  is  positive  definite  implies  that  (D2D2L{q,v))~l  is 
positive  definite.  Then  it  is  easy  to  prove  that  the  m  X  m  Gramian  matrix 

({Pk,  (D2D2L(q,v))~l/3i))  (2.2.29) 

is  invertible.  This  completes  the  proof  of  the  theorem.  | 

Remark  2.2.18: 

(1)  In  [3],  the  invertibility  of  matrix  (2.2.29)  is  considered  as  the  condition  for  the 
existence  and  uniqueness  of  a  solution  for  equation  (2.2.21).  In  classical  mechanics, 
L  =  K  —  V ,  where  K  is  the  quadratic  form  of  kinetic  energy  (or  Riemannian  metric 
on  Q )  and  V  is  potential  energy.  Therefore  the  conditions  of  this  Proposition  are 
naturally  satisfied. 

(2)  If  the  exterior  force  or  control  oje  ^  0,  (2.2.24)  can  be  written  as 

m 

(9k,  n L(dHL  -  «,))  =  £  A i(9k,  (nL  0  r)9i),  k  =  1,  •  •  • ,  m, 

«=i 

and  the  solvability  conditions  is  as  before.  To  keep  the  system  moving  under  the 
constraints,  one  should  make  cLHl,  —  ue  ^  0.  | 
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We  now  study  a  special  type  of  constraint,  which  we  briefly  introduced  earlier 
after  Remark  2.2.8.  Consider  a  foliation  of  TQ  given,  locally,  by  a  set  of  smooth  and 
independent  functions 

{<j>k  :TQ  -*■  »,Jb  =  1, •••,«»}.  (2.2.30) 

We  would  like  to  study  the  motion  of  a  Lagrangian  system  on  a  leaf,  say  zero  level  leaf, 
of  this  foliation, 

<S  =  {(?,  v)  €  TQ\<f>k(q ,  v)  =  0,  k  =  1,  •  • .,  m}.  (2.2.31) 

Then  the  constraint  0  is,  locally, 

Q(q,  v)  =  span{9k  =  {Dxcj>k{q,  v),  D2<f>k(q,  v)),  *  =  1,  •  •  • ,  m}.  (2.2.32) 

Remark  2.2.19:  One  should  note  that  this  constraint,  or  co-distribution,  is  defined 
on  <5,  instead  of  all  of  TQ .  As  we  have  mentioned  in  Remark  2. 2. 8(1),  the  geometric 
objects  (distributions  and  vector  fields)  in  the  above  Definitions,  Lagrange-d’Alembert 
principle  and  the  Proposition  can  be  modified  (restricted)  to  S  with  impugnity.  | 
By  Lagrange-d’Alembert  principle  and  (2.2.32),  the  restricted  dynamic  motion  on 
S ,  satisfies  the  following  equations 

(-7 -D2L(q,  v )  —  DxL(q,  v))  ■  u  =  ae  ■  u  (2.2.33a) 

at 

for  u  satisfying 

D2<fik(q,v)-u  =  0,  k  ■=  1, •  •  -,m  (2.2.33b) 

and  (q,v)  satisfying 

4>k(q,v)=  0,  fc=l,---,m;  (2.2.33c) 

or,  using  multipliers, 

d  m 

—  D2L(q,v )  -  DxL(q,v)  =  ae  +  ^  \kD2<f>k(q,v)  (2.2.34a) 

“  k=  1 

and 

(f>k(q,v)  =  0,  k  =  1,  (2.2.34b) 

A  further  special  case  of  the  above  constraint  is  that  the  4>k ’s  are  linear  functions 
of  Vj  i.e., 

<f>k(9t v)  =  (uk(q)>v) 
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for  some  uk  £  w  1(QI)  for  k  =  1  and  a  submanifold  Q'  of  Q.  With  this 

construction,  the  constraints  can  also  be  considered  as  a  co-distribution  on  Q' ,  denoted 
as 

B  =  {oJk,k  =  1,  •  •  •,  m}. 

In  this  case,  the  equations  of  motion  are  (2.2.33)  or  (2.2.34)  with  D24>k  =  uk,k  = 
1 

Definition  2.2.20:  If  the  co-distribution  B  is  not  integrable,  it  is  called  a  nonholonomic 
constraint  on  Q' ,  otherwise  it  is  a  holonomic  constraint  on  Q' .  | 

Remark  2.2.21:  Equivalently,  one  can  use  the  distribution  BL  to  define  nonholonomic 
and  holonomic  constraints  on  Q' .  | 

Remark  2.2.22:  In  holonomic  problems,  constraints  are  usually  given  by  a  set  of 
functions  on  Q  ,  say 

{fk  :Q  -+»,*;  =  (2.2.35) 

which  gives  a  foliation  of  the  configuration  space  Q .  Then,  the  submanifold  Q'  in 
Definition  2.2.20  is  a  leaf  of  this  foliation.  From  Lagrange-d’Alembert  principle,  the 
motion  of  the  system  satisfies  the  following  equations: 

d  m 

—D2L(q,  v )  -  D\L(q,  v)  =  ae  +  xkDfk{q)  (2.2.36a) 

k=  1 

and 

fk(q)  =  0i  &  =  !,•••,  m.  (2.2.36b) 

From  a  mathematical  point  of  view,  the  holonomic  problem  is  viewed  as  a  simple  case 
since  it  can  be  re-arranged  as  an  unconstrained  problem  (at  least  locally).  But,  as  we 
will  see  in  next  chapter,  physical  systems  (such  as  four-bar  linkages)  with  holonomic 
constraints  may  possess  very  interesting  properties  in  their  kinematics  and  dynamics.  | 
Remark  2.2.23:  It  is  known  that  for  the  systems  with  holonomic  constraints,  the 
Lagrange-d’ ’Alembert’s  principle  is  equivalent  to  the  (Hamilton’s)  principle  of  least  action, 
which  determines  a  curve,  {q(t),t  6  [0, 1]} ,  in  Q  to  satisfying  two  fixed  end  points  and 
constraints,  and  minimizes  the  functional 

/  L(q(t),q(t))dt, 

Jo 
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where  L  is  the  Lagrangian.  It  is  also  known  that,  in  general,  if  the  constraints  are 
nonholonomic,  the  two  principles  are  not  equivalent.  In  particular,  if  the  leaf  constraints 
are  given  by  nonintegrable  functions  (2.2.30),  the  curve  q(-)  €  Q  determined  by  the 
principle  of  least  action  with  constraints  fulfills  the  equation 

i  m  i  m 

jtD2L(q,  v)  -  DxL{q ,  v)  =  £  \k(~D2<f>k(q ,  v)  -  D^iq,  v))  +  £  A kD2<j>\q,  v) 

fc=l  k= 1 

(2.2.37a) 

or,  equivalently, 

d 

-D2£(9,  v)  -  DxC(q,  v)  =  0,  (2.2.37b) 

where  C(q,v)  =  L(q,v)  —  A kcf>k(q,v).  In  [3]  the  systems  determined  by  the 

equations  (2.2.37)  are  said  to  obey  vakonomic  mechanics  (mechanics  of  variational 
axiomatic  kind).  It  has  been  shown  that  for  mechanical  systems  with  nonholonomic 
constraints,  only  the  Lagrange-d’Alembert  principle  provides  physical  motion  (cf.  [39, 
40,48]).  If  Q  is  Riemannian  manifold  with  L  the  metric,  the  above  assertions  say  that  for 
holonomic  mechanics,  the  real  motions  of  the  system  are  geodesics,  but  for  nonholonomic 
mechanics,  it  is  not.  In  Chapter  IV  we  will  study  a  special  nonholonomic  variational 
problem  with  a  meaningful  choice  of  Riemannian  metric.  | 

Example  2.2.24:  Before  ending  this  chapter,  we  consider  a  popular  example  in  the 
literature  of  nonholonomic  dynamics,  that  is,  rolling  ( without  sliding)  a  homogeneous 
sphere  on  an  absolutely  rough  and  horizontal  plane.  We  will  solve  the  constrained 
dynamic  equation  involving  test  vectors. 

Let  m  be  the  mass  of  the  sphere  and  mk 2  the  inertia  of  the  sphere  about  any  axis 
passing  through  the  center,  where  k  is  called  radius  of  gyration.  Let  a  be  the  radius 
of  the  sphere.  Let  0-XYZ  be  a  coordinate  system  in  inertial  space  with  Z-axis  being 
perpendicular  to  the  plane.  Since  it  is  obvious  that  there  is  no  motion  along  Z  direction, 
the  configuration  space  of  the  system  is  5?2  X  50(3).  The  Lagrangian  of  the  system  can 
be  expressed  as 

L  =  ^m(i2  +  y 2)  +  ^mk2(ul  +  w2  +  ul),  (2.2.38) 

where  ( x,y )  gives  the  location  of  the  center  of  the  sphere  on  the  plane  and  (wt,  u>y,uz) 
is  the  angular  velocity  of  the  sphere  expressed  in  coordinate  system  0-XYZ.  The 
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nonholonomic  constraint  of  rolling  without  sliding  can  be  expressed  as 


auix  —  y  and  auy  =  —x.  (2.2.39) 

We  also  assume  that  the  sphere  rotates  on  the  plane  under  the  applied  force  (Fx,  Fy) 
acting  at  the  center  of  the  sphere.  With  the  above  setting,  using  (2.2.33),  we  get  the 
following  dynamic  equations  with  constraints: 

m(xu  i  +  yu  2)  +  mk2(u>xfii  +  d>yn  2  +  w2/i3)  =  Fxu  1  +  Fyu  2  (2.2.40a) 

for  (tii,«2iMi>M2»M3)  being  test  vector  satisfying 

afj,  1  =  u2  and  a/i  2  =  —  U\  (2.2.40b) 


and  (x,y,ux,ojy,ujz)  satisfying  linear  constraints 

aux  =  y  and  auy  —  —x.  (2.2.40c) 


After  a  simple  calculation  to  eliminate  pL\  and  ^2  >  one  can  get 
a2  +  k2  . .. 


m- 


-(iui  +  yu2)  +  mk~uzn  3  =  Fxux  +  Fyu2, 


(2.2.41) 


where  ui,u2  and  /i3  can  be  chosen  arbitrarily.  Then  from  (2.2.41)  and  constraints 
(2.2.40c),  one  gets 


o2  Fx 


a2 

+ 

k2 

m 

a2 

Zk 

a 2 

T 

k 2 

m 

a 

Fy_ 

a2 

T 

k 2 

m 

a 

Fx 

a2  +  k2  m 


(2.2.42) 


uz  =  0. 


If  Fx,  Fy  is  known  the  above  equation  can  be  easily  solved. 

There  are  two  important  features  in  the  above  equations.  First,  the  motion  of  the 
center  of  the  sphere  does  not  depend  on  the  spin  of  the  sphere  about  vertical  direction  of 
the  plane,  i.e.,  the  motion  of  the  center  looks  like  that  of  a  particle  with  modified  mass. 
Secondly,  angular  velocity  uz  is  a  constant.  An  interesting  question  one  may  ask  is  that 
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if  we  consider  conserved  uz  as  an  a  priori  constraint,  by  adding  it  to  the  constraints 
in  (2.2.39)  and  using  the  above  approach,  can  we  get  the  same  dynamic  equation?  The 
answer  is  yes.  In  fact,  equations  (2.2.40)  now  have  the  following  form: 

m(xu  i  +  yu2)  +  mk2(uxpx  +  uyp2  +  3)  =  Fxux  +  Fyu  2  (2.2.43a) 

for  (ui,u2,/ii,/i2,//3)  being  test  vector  satisfying 

ap  i  =  u2  and  ap2  =  ~ ui  and  A*3  =  0  (2.2.43b) 


and  (x,y,ux,Ljy,u!z)  satisfying  constraints 

au>x  =  y  and  au>y  =  —x  and  u>2  =  c,  (2.2.43c) 


which  is  affine.  After  a  straightforward  calculation  from  (2.2.43),  one  gets 

a2  +  k 2  . .. 


m- 


-(xui  +  yu2)  =  Fxux  +  Fyu2, 


(2.2.44) 


where  ux,u2  can  be  chosen  arbitrarily.  Using  (2.2.44)  and  (2.2.43c),  the  final  result  is 
the  same  (2.2.42). 

It  is  very  important  to  observe  that,  after  using  u>z  =  c  as  one  of  the  constraints, 
we  get  a  lower  dimensional  dynamic  equation  (2.2.44)  which  is  independent  of  rotational 
variables.  The  theory  behind  this  phenomenon  and  the  systematic  approach  to  derive 
such  reduced  equations  are  precisely  what  we  will  explore  in  Chapter  V.  | 
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CHAPTER  III 


GEOMETRY  AND  DYNAMICS  OF 
FLOATING  FOUR-BAR  LINKAGES 


Inspired  by  many  mechanical  problems  in  space  engineering,  significant  progress 
has  been  made  to  understand  dynamics  of  coupled  mechanical  systems,  which  consist 
of  serial  links  or  open  chains ,  by  applying  geometric  methods,  symmetry  principles  and 
reduction,  cf.  [21]  and  references  therein.  On  the  other  hand,  problems  in  aerospace 
engineering  also  suggest  that  multibody  systems  with  kinematic  loops  are  of  practical 
importance.  In  this  chapter,  we  study  the  kinematics  and  dynamics  of  the  simplest 
mechanical  system  with  a  kinematic  loop,  which  is  represented  as  (kinematic)  holonomic 
constraint  —  the  planar,  floating  four-bar  linkage.  By  floating  we  mean  that  no  link  is 
fixed  in  space.  It  is  simplest  in  that  it  has  the  fewest  degrees  of  freedom  among  all  kinds 
of  mechanisms  with  closed  loops. 

There  are  at  least  two  important  aspects  of  our  study  of  such  systems.  First,  as 
we  shall  see  in  this  chapter,  the  presence  of  loop  closure  constraints  implies  that  the 
knowledge  of  the  Hamiltonian  structure  and  phase  portraits  for  open  chain  multibody 
systems  cannot  be  applied  directly  to  systems  with  closed  loops.  A  careful  study  of 
a  simple  coupled  system  with  closed  loop  is  necessary.  Secondly,  we  note  that  in  the 
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theory  of  machines,  four-bar  linkage  (with  one  link,  the  frame,  fixed)  are  often  used  to 
synthesize  more  complex  mechanisms  [19].  Moreover,  it  can  generate  the  wide  variety  of 
motions  represented  by  coupler  curves  [18].  For  floating  four-bar  linkages,  one  expects 
similar  properties  to  hold. 

The  outline  of  this  chapter  is  as  follows.  After  stating  the  basic  notations  for 
this  chapter  in  Section  3.1,  we  give  a  geometric  description  of  the  configuration  space 
in  Section  3.2.  In  Section  3.3,  we  give  an  explicit  expression  for  the  kinetic  energy  of 
the  system.  In  Section  3.4,  we  explore  symmetry  properties,  Hamiltonian  structure  and 
reduction  of  four-bar  linkage  dynamics.  In  Section  3.5  a  theorem  of  Smale  is  used  to 
compute  relative  equilibria  for  the  dynamics  of  a  four-bar  linkage.  Then,  using  techniques 
from  singularity  theory,  we  study  the  local  bifurcations  of  relative  equilibria  for  linkages 
which  admit  symmetric  shapes. 


3.1  Notations  and  Geometric  Constraints 

The  structure  of  a  closed  floating  four-bar  linkage  is  represented  in  Figure  3.1.1. 
What  we  mean  by  bar  in  this  chapter  is  a  planar  rigid  body,  on  which  the  center  of  mass 
and  pin  joints  are  located  arbitrarily.  The  bars  are  labeled  sequentially  from  0  to  3.  On 
each  bar,  a  body-fixed  frame  is  attached  such  that  its  origin  is  at  the  center  of  mass  of  the 
bar  and  the  x-axis  is  parallel  to  the  line  connecting  two  joints  on  the  bar.  In  particular, 
we  choose  the  positive  direction  of  of  x-axis  of  i-th  bar  towards  the  (i  +  l)-th  bar,  for 
i  —  0, 1,2,3  (mod  4).  We  define  the  following  quantities. 

djj  the  vector  from  the  center  of  mass  of  i-th  bar  to  the  joint  with  j-th  bar  in 

body-fixed  frame; 

r,  the  position  vector  of  the  center  of  mass  of  i-th  bar  relative  to  an  inertial 

observer; 

r?  the  vector  from  the  system  center  of  mass  to  the  center  of  mass  of  i-th  bar; 

rc  the  position  of  the  system  center  of  mass  relative  to  the  reference  point  of 

the  inertial  observer; 
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observer 

Figure  3.1.1  The  general  structure  of  four-bar  linkage 

Oi  the  orientation  angle  of  i-th  bar  relative  to  the  inertial  frame; 

dij  the  relative  angle  between  i-th  bar  and  j-th  bar,  i.e.,  =  &i  —  9j; 

li  the  length  of  i-th  bar,  which  is  defined  as  the  distance  between  the  joints 

on  i-th  bar,  i.e.,  i,  =  Hd^j+x  -  d* || ; 

m,-,  I{  the  mass  and  the  moment  of  inertia  of  i-th  bar  about  its  center  of  mass; 

m  the  total  mass  of  the  system,  i.e., 

3 

771  =  777 

t=0 

Any  pair  of  adjacent  bodies  is  connected  by  the  following  relation,  the  hinge 
constraint , 

rf+1  =  rf  +  )di+lii  i  =  0, 1,2,3  (mod  4),  (3.1.1) 
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where 


cos(di) 

sin(9i) 


—sin(Oi)  \ 
cos(di )  ) 


the  rotation  matrix.  By  eliminating  rf  in  (3.1.1)  we  find  the  loop  constraint  or  closure 


constraint , 


3 

y;  _  d«,*-x)  = 

4  =  0 


where  the  convention  of  modulo  4  addition  for  the  subscripts  is  adopted. 


(3.1.2) 


3.2  The  Configuration  Space 

In  this  section  we  investigate  the  conditions  under  which  the  loop  constraint  (3.1.2) 
describes  a  submanifold  of  the  configuration  manifold  of  an  open  four-bar  chain. 

For  a  planar  floating  four-bar  open  chain,  the  configuration  space  is 

M  =  ft2  X  Sl  x  S1  x  Sl  x  S\ 

Here  M  is  a  6-dimensional  manifold  with  local  coordinates  m  =  (xo,  2/o,  #o,  #i,  #2,  #3)- 
This  corresponds  to  keeping  track  of  a  material  point  (say  center  of  mass  on  one  of  the 
bodies)  and  the  four  absolute  orientations.  See  [34,43,44]  for  the  Hamiltonian  mechanics 
of  such  open  chains. 

For  a  closed  four-bar  mechanism  considered  in  this  chapter,  the  configuration  space 
denoted  by  Q  is  a  subset  of  M  determined  by  (3.1.2),  i.e., 

Q  =  {m  6  M|F(m)  =  0} 

for 

3 

F(m)  =  £  R(9i)( di)i+1  -  d <,,•_!).  (3.2.1) 

4  =  0 

For  Q  to  be  a  submanifold  of  M ,  we  have  the  following  condition. 

Theorem  3.2.1:  If 

lo  ±  l\  ±  h  ±  ^3  7^  0, 

Q  is  a  submanifold  of  M . 
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Proof:  Note  that  F  :  M  — >  §i2 .  From  [17]  we  know  that  if  0  is  a  regular  value  of  F, 
i.e.,  dF/dm  has  full  rank  for  all  m  £  M  satisfying  F(m)  =  0,  then  Q  is  a  submanifold 
of  M . 

From  (3.2.1)  we  have 

dF  _  f  0  0  —iQsin(Oo)  —l\sin(9i)  — l2sin{92 )  —l3sin(93)\ 

dm  ~  \0  0  Iqcos(9q)  hcos(9i )  l2cos{92)  hcos(93)  )  ' 

It  is  easy  to  check  that  all  the  nontrivial  determinants  of  2  X  2  submatrices  are  given  by 
the  following  functions, 

fli(m)  =  l0lisin(9i  -  90 ) 

52(m)  =  lQlosin(92  -  90 ) 

53( m)  =  lQl3sin(93  -  90)  (3.2.2) 

<74(m)  =  lil2sin{92  -  #i) 

175(1x1)  =  l\l3sin{93  -  9X) 
g6(m)  =  Ul3sin(93  -  92). 

Therefore,  if  for  each  m  €  M  satisfying  F(m)  =  0,  there  exists  an  i  such  that 
9i( m)  ±  0,  Q  is  a  submanifold  of  M .  It  is  obvious  that  the  above  condition  depends  on 
the  relative  angles  and,  consequently,  the  lengths  of  the  bars.  To  arrive  at  the  condition 
in  the  statement  of  the  theorem,  we  proceed  from  the  converse. 

If  <7t(m)  =  0  for  all  i,  from  (3.2.2),  we  have 


9l  —  9q  =  0  or  ir 


(3.2.3a) 


and 


93  —  92  =  0  or  7r 


(3.2.3b) 


and 


<?3  -  90  =  0  or  it. 


(3.2.3c) 
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Premultiplying  (3.1.2)  by  R(-80),  we  get  following  equivalent  closure  constraint 
equations 

lo~\~hcos(@l  ~  <?0 )  +  /2(co.s(03  —  02)co.s(03  ~  0o ) 

+  sin(9 3  -  02)sm(03  -  0o))  +  /3cos(03  —  02)  =  0  (3.2.4a) 

l\sin{6\— 80)  +  /2(cos(0 3  -  02)stn(03  —  #0) 

-  sin(03  -  02)cos(93  -  &q))  +  l3sin(d3  -  02)  =  0.  (3.2.4b) 

So,  to  make  0,-’s  satisfy  both  (3.2.3)  and  (3.2.4),  the  lengths  of  the  bars  should  satisfy 

/o  +  (-l)fcl/i  +  (-l)fc2/2  +  (-l)fcsI3  =  0  (3.2.5) 

for  some  k\,  fc2,  Ar3  £  {0, 1} .  By  contradiction,  if,  for  all  k\,  fc2,  A;3  £  {0, 1} ,  (3.2.5)  does 
not  hold,  then  there  is  an  i  such  that  <7j(m)  ^  0,  which  means  that  Q  is  a  submanifold 
of  M .  | 

Table  3.2.1  summarizes  the  cases  which  violate  the  conditions  on  lengths  of  the 
bars  in  Theorem  3.2.1.  It  is  easy  to  observe  that  case  (i)  can  never  happen  since  /,  are 
assumed  to  be  positive.  In  addition,  cases  (iii),  (iv),  (v)  and  (vi)  are  trivial  since,  in 
these  cases,  none  of  the  relative  angles  can  vary,  i.e.,  the  configuration  space  loses  one 
degree  of  freedom  and  the  linkage  becomes  a  rigid  structure. 

In  the  following  we  show  that  the  condition  in  Theorem  3.2.1  can  be  simplified  by 
ignoring  the  labels  on  the  bars  and  give  a  topological  description  of  Q .  We  first  recall 
some  definitions  and  results  in  the  classical  theory  of  mechanisms  [15,36,37]. 

For  a  four-bar  linkage  in  classical  mechanics,  in  which  one  bar,  the  frame ,  is  fixed, 
the  following  quantities  are  defined: 

5  =  length  of  shortest  bar, 
l  =  length  of  longest  bar, 
p,  q  =  lengths  of  intermediate  bars. 

A  bar  which  is  free  to  rotate  through  27 r  with  respect  to  a  second  bar  is  said  to  revolve 
relative  to  the  second  bar  and  is  referred  to  as  a  crank.  Any  bar  which  does  not  revolve 
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©1~  0Q  83-  83  83-  0Q 

link  condition 

0  0  0 

.1  Q+l  +  =  0 

0 

0 

l0+ll'l2'1  3=0 

0 

0 

lQ+ll'l2+l2=0 

0  7t  71 

l  q4*  l  ^  4"  ^  ^  0 

0 

0 

Iq-I  j+z  2+ 1^=0 

7t  0  7t 

lQ'll'l2l  f° 

K  K  0 

V  ^  r  ^  3=^ 

71  K  TC 

Iq-I 

Table  3.2.1  Singular  Configurations 

is  called  a  rocker.  If  it  is  possible  for  all  bars  to  become  simultaneously  aligned,  such  a 
state  is  called  a  change  point  and  the  linkage  is  said  to  be  a  change-point  mechanism. 
Given  four  bars,  assembling  them  into  a  closed  loop  and  labeling  them  sequentially,  the 
linkage  may  have  instantaneously  one  of  three  kinds  of  shape.  These  correspond  to 

f  >  0; 

sin(63  -  d2)  <  <  0; 

l  =  0. 

In  the  classical  theory  of  mechanisms,  they  are  called  lagging  form,  leading  form  and 
dead  point ,  respectively.  The  following  theorem  is  due  to  Grashof  [15]. 

Theorem  3.2.2: 

(1)  A  four-bar  mechanism  has  at  least  one  crank  if 

Sri  <  p  4-  q 


35 


and  all  three  will  rock  if 


s  +  l  >  p  +  q. 

(2)  A  fonr-bar  mechanism  is  a  change-point  mechanism  if  and  only  if 

s  +  l  =  p  +  q.  | 

Remarks  3.2.3: 

(1)  If  s  +  l  <  p  +  q ,  the  shortest  bar  is  the  revolving  bar. 

(2)  It  is  easy  to  check  that  the  cases  (ii),  (vii)  and  (viii)  in  Table  3.2.1  correspond  to 

s  +  1  =  p  +  q,  i.e.,  they  correspond  to  change-point  mechanisms.  | 

Since  the  conditions  in  Grashof’s  Theorem  3.2.2  are  so  important,  we  give  the 

following  definition. 

Definition  3.2.4:  We  refer  to  the  condition  s  +  J  <  p  +  q  and  s  +  l  >  p  +  q  as  Grashof 
and  non- Grashof  condition ,  respectively.  The  corresponding  linkages  are  referred  to  as 
Grashof  and  non- Grashof  mechanism  or  linkage,  respectively.  | 

From  Theorem  3.2.1,  we  immediately  have  following  result. 

Corollary  3.2.5:  If  a  four-bar  linkage  is  constructible,  i.e.,  I  <  s  +  q  +  p  and 

s  +  l^p+q, 

Q  is  a  submanifold  of  M .  In  other  words,  Q  is  a  submanifold  of  M  if  either  Grashof 
or  non- Grashof  condition  holds.  | 

To  give  a  topological  description  of  the  configuration  manifold  Q  ,  we  first  need  the 
following  result. 

Proposition  3.2.6:  Let  l\  =  s.  Then  s  +  1  <  p  +  q  if  and  only  if  sin(6 3  —  02)  ^  0  for 
all  configurations. 

Proof:  The  mechanism  can  be  assembled  with  s  adjacent  to  /  or  with  s  opposite  l . 
And,  l  can  be  l0,  U  or  1$.  If  #3  —  62  =  kir ,  the  whole  structure  attains  a  triangular 
shape  which  has  the  property  that  the  sum  of  two  sides  is  larger  than  the  third  one. 
Then  it  is  easy  to  check  that  all  possible  cases  will  lead  to  s  -(-  l  >  p  +  q.  Further,  it  is 
obvious  that  if  s  +  /  =  p+  q,  there  exist  and  #3  such  that  sin(9s  —  02)  =  0.  And,  if 
s  +  l  >  p  +  q,  by  Grashof’s  theorem,  all  bars  will  rock  with  respect  to  each  other.  This 
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means  that  dead  point  is  reachable.  Therefore,  if  sin(9 3  -  92 )  7^  0  for  all  configurations, 

s  +  l  <p  +  q.  | 

An  equivalent  way  to  state  the  above  assertion  is  that  a  four-bar  linkage  is  a  Grashof 
mechanism  if  and  only  if  it  is  constrained  to  be  in  either  leading  form  or  lagging  form. 
Moreover,  for  non-Grashof  mechanisms,  the  linkage  can  vary  continuously  from  leading 
form  to  lagging  form. 

From  Grashof’s  theorem  and  the  above  proposition  we  can  get  a  topological 
description  for  Q . 

Theorem  3.2.7: 

(a)  For  a  Grashof  linkage,  i.e.,  s  +  I  <  p  +  q,  Q  has  two  components.  Each 
component  is  diffeomorphic  to  K2  x  51  x  S1 . 

(b)  For  a  non-Grashof  linkage,  i.e.,  s  +  l  >  p  +  q,  Q  =  5R2  X  S1  X  S1 . 

Proof :  Our  proof  is  based  on  explicit  parameterization  of  the  manifold  Q .  Recall  that 
from  (3.2.1)  the  dimension  of  Q  is  four.  Again,  we  let  s  —  l\ . 

(a)  If  s  +  l  <  p  +  q,  we  consider  the  parameters  (z0, 2/0 ■>  #o,  #i) ,  where  ( x0,yo )  is  the 
coordinate  of  any  point  on  0-th  bar  in  inertial  frame.  From  the  definitions  of  9q  and  9\ 
and  the  Grashof  Theorem  3.2.2,  both  9q  and  9\  can  vary  from  —  n  to  7r  independently. 
From  Proposition  3.2.8,  one  component  of  the  manifold  Q  corresponds  to  leading  form. 
The  other  corresponds  to  lagging  form. 

(b)  If  s  +  /  >  p  +  q,  from  Grashof  theorem,  there  exists  an  angle  a,  0  <  a  <  7r, 
such  that,  9io  G  [— a  +  7r,a  +  7r].  At  the  boundaries,  the  system  is  at  “dead  point”.  Now, 
we  consider  the  independent  parameters  (xo,yo,9o,9) ,  where 

if  sin{032)  >  0  (lagging  form); 
if  sin(932)  <  0  (leading  form). 

It  is  easy  to  check  that  9  and  5jo  have  a  one-to-one  relation  in  both  leading  form  and 
lagging  form.  Moreover,  9  varies  from  —ir  to  n.  For  9  G  (— tt,0),  the  mechanism  is 
in  the  leading  form;  for  9  e  [0,7r],  the  mechanism  is  in  lagging  form.  So  (xo,  yo,9o,&) 
parameterize  3?2  X  51  X  51 .  ■ 
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Remarks  3.2.8: 


(1)  Gibson  and  Newstead  in  [13]  proved  above  assertion  using  the  methods  of  algebraic 
geometry. 

(2)  This  theorem  says  that  for  a  fixed  location  of  the  center  of  mass  in  5J2 ,  the 
configuration  space  is  the  disjoint  union  of  two  tori  for  a  Grashof  linkage  and 
a  torus  for  a  non-Grashof  linkage.  For  the  latter,  the  torus  can  be  split  into  two 
parts,  one  corresponding  to  leading  form,  the  other  corresponding  to  lagging  form. 
It  can  be  imagined  that,  when  I  +  /  =  q  +  p,  two  tori  touch  each  other  with  a 
circular  intersection.  This  can  be  seen  in  the  following  example. 

(3)  From  the  above  remark,  we  note  that,  for  both  Grashof  and  non-Grashof  linkages, 

one  may  use  (xq,  ya,  90,  9x)  and  specified  form  (leading  or  lagging)  to  parameterize 
Q  locally.  For  the  non-Grashof  case,  one  has  to  worry  about  the  parameterization 
in  the  neighborhood  of  the  dead  point.  This  problem  can  be  solved  by  re-labeling 
the  bars.  We  will  discuss  it  further  in  the  next  section.  1 


Figure  3.2.1  Reduced  configuration  spaces,  an  example 
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Example  3.2.9:  Here  we  illustrate  the  Theorems  3.2.1  and  3.2.7  by  a  simple  example. 
First,  we  fix  one  bar  (position  and  orientation),  say  lo  and  let  9q  =  0.  This  means  a 
point  in  5J2  X  51  has  been  chosen  and  the  dimension  of  the  configuration  is  reduced  to 
one.  Now  the  constraint  equation  (3.1.2)  becomes 

l0  +  hcos(9i)  4-  l2cos(02)  +  hcos(d3 )  =  0 

lisin(di)  +  l2sin(92)  +  l^sin^o)  —  0. 

Eliminating  one  more  angle,  say  d3 ,  we  get 

f(9\,92)  =  (lo  +  l\cos(9\)  +  l2cos(62 ))2  +  ( lisin(91 )  +  l2sin(02 ))2  -  1%  =  0.  (3.2.7) 

The  solutions  of  this  equation  gives  a  curve  on  a  torus  T2  as  the  configuration  space. 

Now  we  choose  l0  =  3,  /i  =  3,  Z3  =  4  and  let  l2  vary.  Figure  3.2.1  shows  the 
results.  The  rectangle  with  opposite  edges  identified  is  the  standard  way  to  represent  a 
torus  [17].  We  see  that  when  l2  —  1.9,  s.+  l  <  p  +  q  holds  and  the  solution  of  (3.2.7)  on 
T2  is  a  disconnected  closed  curve;  when  l2  =  2,  $  +  l  =  p  +  q  holds  and  the  solution  of 
(3.2.7)  on  T2  is  a  “figure  8”;  when  l2  =  3,  s  +  l  >  p+  q  holds  and  the  solution  of  (3.2.7) 
on  T2  is  a  connected  closed  curve;  when  l2  =  4,  s  +  1  =  p  +  q  holds  and  the  solution  of 
(3.2.7)  on  T2  is  a  “figure  8”,  again;  when  l2  =  4.5,  s  +  1  >  p  +  q  holds  and  the  solution 
of  (3.2.7)  on  T2  is  a  single  closed  curve,  again.  If  the  linkage  is  allowed  to  float,  we  get 
the  configuration  spaces  described  in  Theorem  3.2.7.  ■ 


3.3  Kinetic  Energy 

In  this  section  we  derive  the  kinetic  energy,  or  Lagrangian  since  we  assumed  that 
no  potential  energy  is  involved,  for  the  whole  system.  The  basic  idea  is  to  write  the 
kinetic  energy  for  each  individual  body  first  and  then  use  the  constraint  equations  to 
eliminate  extra  variables. 

The  kinetic  energy  of  the  i-th  bar  is 

Ti  = 
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where  u>i  —  9i.  The  total  kinetic  energy  is 

T  =  \  INI2- 


2 

i=0 


2  f 

!=0 


(3.3.1) 


To  describe  the  kinetic  energy  relative  to  the  center  of  mass,  we  have  following  useful 
equations, 


rt-  =  rc  +  r- , 
3 

=  °- 


i  =  0,1, 2, 3 


(3.3.2) 

(3.3.3) 


«= o 


(3.3.4) 


By  applying  (3.3.2)  and  (3.3.3),  (3.3.1)  becomes, 

3  3 

T  =  i^mi||r8c||2  +  ^m||rc||2. 

!=0  i=0 

Applying  (3.1.1)  and  (3.3.2),  we  get 

r-  =  — [i2(6»!_1)mi_idi_1,1- 
m 

-  £(0i)(mi_1di)i_1  +  (mi+i  +  mt+2)dijJ+1) 

+  R{9i+i){{mi+i  +  ^i+2)d,+i,,-  —  mj+2dl+i,,+2) 

+  ^(^i+2)TOi'+2di+2.i+l], 

for  i  =  0, 1,2,3  (mod  4).  Note  that  the  convention  on  the  subscript  can  be  used  because 
of  the  closed-loop  condition.  Furthermore, 

r?  =  —  [mj-iWi-i  fl(0i_i)d, 

m 

-  w;itl(0;)(mi_1dj,i_i  -j-  (mi+i  +  mJ+2)dtii+i) 

+  Wi+ii?(0i+i)((mi+i  +  mi+2)d,+iii  —  mj+2d,+i>j+2) 

+  &i+2  f^(^i+2  )^i-(-2  d  j-|-2,J+l]  > 


(3.3.5) 


for  i  =  0, 1, 2, 3  (mod  4),  where 


u>i  = 


0  —  u>i 

0 


LJi 


By  substituting  the  formula  for  rf  into  (3.3.4),  we  get  a  more  compact  expression  for 
the  kinetic  energy 
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(3.3.6) 


T  =  \  <  w,Mu>  >  +^m||rc||2, 

where  Q  =  (o>o, wi,w2,W3)t  and  M  =  ( =  0,1, 2, 3)  is  a  4x4  symmetric  matrix, 
with  elements  given  as  follows. 

Let 


m{  =  —  4-  mi+1) 

TYl 


+  mjmi+2(mi+i  +  mi+2)] 

(3.3.7a) 

-  II  mi  1  2  \ 

mi  =  — 0  (mi+2  “  mi+lmi-l 

TO'4 

(3.3.7b) 

-  in  mimi+2  ,  x 

mt-  =  (mi+i  +  mM  . 

(3.3.7c) 

Then 

Mu  =  Ii  +  mf||d||2t.+1  +  T»f_1||d||Ji4_1 

-  2fhjL1  <  dj.i+i, >  (3.3.8a) 

M =  ~fhi  <  di,i_f.i,  i2(^i-M,*)dt4.i,i  > 

+  Thf1  <  d,it-+i,  fZ(0,+iij)di+iii+2  > 


+ 

< 

d; 

,i-l»  ^(^t'+l,i)dj+l,i  > 

+ 

ml11 

< 

d, 

R(0i+l,i)di+l,i+2 

> 

(3.3.8b) 

=  - 

-ml1 

< 

d„ 

t+1,  ^(^i+2,f)di+2,j+l 

> 

- 

fhIH 
171  i+ 1 

< 

dj 

,t+l,  -R(^t+2,t')dt+2,t-l 

> 

- 

fhH 
mi+ 2 

< 

d  i 

,i—l ,  -&(0i+2,i')dt+2,i-l 

> 

— 

mf" 

< 

d, 

,i-l ,  f?(^i+2,i)di+2,i+l 

> 

(3.3.8c) 

for  i  =  0,1, 2, 3  (mod  4). 

Remark  3.3.1:  Let  the  four  bars  be  labeled  sequentially  such  that  —  s.  It  is  clear 
that  the  dependence  of  T  on  i  =  0,1, 2, 3  can  be  rearranged  so  that  kinetic  energy 
T  depends  on  #io  and  #32-  In  addition,  from  (3.1.2)  or  (3.2.4),  i  =  0,1, 2, 3  depend 
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on  each  other  by  constraint.  From  [36],  we  have  the  following  explicit  dependence  of  62 
and  63  on  0q  and  0\  : 

82  =  &o  +  f(8io;  sgn(sin(032))) 

83  =  0o  +  £f(0io;  sgn(sin{0 32))), 


where 


for 


and 


f(a  (■  to  \\\  /  7T  -  p+  if  sin(d 32)  >  0; 

/(«,„;  ,gn(s,n(S„)))  =  <  0 

0±  =arctan2(—hsin(0io),lo  +  /icos(#io)) 

±  arctan2 ((4/3/2  ~  Oo  +  ^1  ~  ^3  ~  ^2  +  2/o/icos(0io))^, 
2/3  ( /q  +  l\  +  2/o/lCOs(0io))5) 

//I  /  •  //)  \ \ \  J  — 7r  +  7+  if  sm(032)  >  0; 

*,„(«„(*„)))  =  (  _x  +  7-  if  sj„J„32;  <  j. 


for 


7  *  =  arctari2(l3sin((3± )  +  /isin(0io)>  /o  —  hcos{^)  +  /icos((?io)). 


In  above  expression  arctari2  is  defined  as  follows.  Let 


x  =  cos(O)  and  y  =  sin(0 ) 


9\  =  tan  1  — 
a; 


( principle  value). 


Then, 


f  #i ,  if  x  >  0; 

0  =  arctan2(y,  x)  —  <  8\  +  sgn(y)TT,  if  x  <  0; 

(  sgn(y)j,  if  x=0. 

From  the  above  expressions  we  see  that  the  value  of  632  depends  not  only  on  01O , 
but  also  on  sgn(sin(03 2)),  i.e.,  leading  or  lagging  form.  For  a  Grashof  linkage,  on  each 
component  of  the  configuration  space,  the  sign  of  sin(032 )  does  not  change.  For  a  non- 
Grashof  linkage,  we  note  that  for  almost  all  values  of  #32  there  exists  a  neighborhood 
about  #32  such  that  #32  depends  on  0io  uniquely,  except  at  the  dead  points,  where 
sin(032)  =  0.  As  we  have  mentioned  in  Remark  3.2. 8(3),  to  deal  with  this  problem  in 
such  neighborhoods,  an  easy  way  is  to  re-label  four  bars  such  that  szn(032)  7^  0.  One 
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can  always  do  so  since  all  four  bars  cannot  be  aligned  in  this  case.  Based  on  the  above 
discussion  we  conclude  that,  in  general,  at  each  point  in  Q ,  every  element  of  the  matrix 
M  can  be  expressed  as  a  function  of  .  | 

Differentiating  the  loop  constraints  (3.1.2),  one  can  get  a  relation  between  (u>o,o>1) 


and  (w2,w3): 

(:)=*(:)’ 


(3.3.9) 


where 


(_  losin(03—8o) 

/2sin(03  —  02) 
lpsin(  9-2— 6  o ) 
l3sin(83  —  02) 

Again,  following  the  same  line  of  reasoning  as  in  Remark  3.3.1,  the  matrix  N  is  well- 
defined  locally  in  general. 

We  summarize  the  above  discussion  in  the  following  theorem. 

Theorem  3.3.2:  The  kinetic  energy  of  a  floating  four-bar  linkage  can  be  represented 
as 

r=  i  <U,MU>  +^rn||rc||2,  (3.3.11) 


_  h  stn(fl3  — fl! ) 
l2sin(83  —  02) 
h  sin(02  —  9i ) 

l3sin(93  —  92) 


(3.3.10) 


where  u  =  (wo,wi)T  and 

M  =  (/  NT)M^^j  (3.3.12) 

for  matrix  M  given  in  (3.3.8)  and  matrix  N  given  in  (3.3.10).  The  elements  of  matrix 
M  are,  locally,  functions  of  $io  •  ■ 

Remark  3.3.3:  As  in  the  case  of  planar  multibody  systems  with  open  chains,  the 
kinetic  energy  of  a  four-bar  linkage  depends  on  the  relative  angles  of  the  bars,  although 
such  dependence  appears  more  complicated.  By  capturing  this  property,  we  are  able  to 
study  symmetry  and  reduction,  as  will  be  seen  in  next  section.  g 

Before  ending  this  section,  we  give  a  property  of  the  matrix  N  which  will  be  used 
in  next  section. 

Proposition  3.3.4:  If  N  is  well  defined, 
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Proof :  Premultiplying  (3.1.2)  by  R(—9 2)  and  R(— #3),  we  get 


and 


losin(60  —62)  +  hsin(6i  —62)  +  l3sin(93  -  02)  =  0 


l0sin(90  -  03 )  +  l\sin(9\  —  93)  -f-  l2sin(92  -  93)  —  0 


respectively.  The  result  follows  immediately. 


I 


3.4  Symmetry,  Integral  and  Reduction 

We  shall  show  here  that  a  floating  four-bar  linkage  is  a  simple  mechanical  system 
with  symmetry  in  the  sense  of  Smale,  which  has  been  defined  in  Section  2.1.  For 
simplification,  we  make  two  assumptions.  First,  we  restrict  attention  to  the  type  of 
Grashof  linkages,  i.e.,  Grashof  condition  s  +  l  <  p  +  q  holds,  and  assume  that  the  linkage 
has  been  in  either  leading  or  lagging  form.  The  four  bars  are  labeled  sequentially  with 
li  =  s.  For  the  non- Grashof  case  we  will  give  a  discussion  in  the  final  remark  of  this 
section.  Secondly,  we  assume  that  the  inertial  observer  is  placed  at  the  center  of  mass  of 
the  system.  We  are  able  to  do  so  since  the  kinetic  energy  in  (3.2.11)  is  invariant  under 
the  translation  in  inertial  space.  In  [44]  this  process  is  explained  via  symplectic  reduction 
by  the  translation  group  3?2 . 

Keeping  the  above  assumptions  in  mind,  we  can  identify  each  ingredient  in  the  4- 
tuple  ( Q,K,V,G )  of  simple  mechanical  system  with  symmetry.  The  configuration  space 
now  is  simply 

Q  =  S1  x  51. 

Each  point  q  in  Q  is  parametrized  by  q  —  (9o,9i),  the  absolute  angles  of  0-tli  and  1-st 
bar.  The  Riemannian  metric  on  Q  is,  for  each  q  €  Q , 

K{q){vq,wq)  =<  vq,lA(q)wq  >, 

where  vq,wq  £  TqQ  ~  3?2  and  matrix  M  is  given  in  Theorem  3.3.2.  The  potential 
energy  V  has  been  assumed  to  be  zero.  Let  G  =  51  ~  50(2)  be  the  Lie  symmetry 
group.  Its  action  $  on  Q  is  defined  by,  for  any  <j>  6  G , 

$(<^7  (#0;  #i))  =  (0o  +  4>,  9\  +  <f>).  (3.4.1) 
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As  have  been  discussed  in  Remark  3.3.1  and  3.3.3,  matrix  M  depends  on  9\ 0  only.  This 
implies  that  G  acts  on  Q  by  isometries.  Therefore,  we  conclude  that  a  floating  four-bar 
linkage  is  a  simple  mechanical  system  with  symmetry.  In  addition,  for  this  system,  the 
associated  Lagrangian  is  given  by 

L{q,v q)  =  i  K(q)(vq,vq )  =  i  <  i>„M(0lo)t>g  >,  (3.4.2) 

where  vq  =  (#o,  0\)T ,  and  the  Hamiltonian  is  given  by 

H(q,p)=\  <p,M-\0io)p>,  (3.4.3) 

where  p  =  (po5Pi)T  =  K^(q)(vq)  —  M(0io)wg  E  T*Q  ~  3ft2 .  It  is  clear  that  H  is 
G  -invariant. 

Remark  3.4.1:  From  the  above  discussion,  we  see  that  the  dynamic  structure  of  a 
floating  four-bar  linkage  is  the  same  as  that  of  a  coupled,  planar  two-body  [43]  although 
the  explicit  forms  of  the  Riemannian  metrics  are  very  different.  Therefore,  in  some 
essential  ways,  the  description  of  these  two  types  of  systems  are  parallel  to  each  other. 
Because  of  this,  we  will  not  prove  the  following  assertions  about  reduction  in  detail.  For 
reference,  see  [43].  | 

Let  £  =  £,  a  constant  in  Q  =  0?  (the  Lie  algebra  of  S'1).  One  can  show  that  the 

infinitesimal  generator  of  the  action  $  given  in  (3.4.1)  corresponding  to  £  is  simply 

6j(?)  =  1(1,  if-  (3-4.4) 

Then,  the  momentum  map  J  :  T*Q  — *  Q  =  so* (2)  for  the  cotangent  lift  action  of  $  (cf. 
Section  2.1)  determined  by  (2.1.26)  is 

(J(q,p),0  =<  P,{q(q)  >=  ttPo  +Pi)  =  v-  (3.4.5) 

From  Noether’s  Theorem,  it  is  easy  to  show  that  v  or  p  =  J(<?,p)  =  po  +p\  is  conserved 
along  trajectories  of  Xh  for  the  Hamiltonian  H  in  (3.4.3)  and  that  p  is  simply  the  net 
angular  momentum  of  the  floating  four  bar  linkage  relative  to  an  observer  at  the  system 
center  of  mass. 

Since  the  dynamical  trajectories  are  confined  to  a  level  set  of  the  form  J_1(/x)  and 
the  group  51 ,  viewed  as  the  isotropy  subgroup  of  the  momentum  value  p,  acts  freely 
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on  J-1(p),  one  gets  the  symplectically  reduced  dynamics  on  the  reduced  phase 

space  P n  =  J~1(p)/S1  ~  51  x  K1 ,  where  is  the  reduced  Hamiltonian.  For  details 
of  symplectic  reduction,  see  [1]. 

As  in  [44]  it  is  also  possible  to  Poisson-reduce  the  dynamics.  Following  the  standard 
Poisson  reduction  framework  which  has  been  shown  in  Section  2.1,  we  have  the  following 
results. 

Since  the  manifold  M  =  T*(SX  X  S 1)  parameterized  by  2  =  (00, 0i,Po,Pi)  is 
symplectic,  a  Poisson  structure  {  ,  }m  can  be  constructed,  as  we  indicated  in  Remark 
2.1.1.  It  is  easy  to  show  that  this  structure  is  of  the  form 


n  t  -ll.  -ll.  d/2 \ 

{fuf2}M(z)  -  2J  de.  •  dp.  dp.  ■  de. ) 


(3.4.6) 


i= 0 


for  all  fi,  f2  G  The  action  of  G  =  S1  on  Q  given  by  (3.4.7)  is  free  and  proper. 

The  quotient  P  =  T*(Sl  X  Sl)/Sl  ~  S1  X  K2  carries  a  reduced  Poisson  structure. 
Parameterizing  P  by  z  —  (#io,Po,Pi) ,  the  noncanonical  Poisson  bracket  on  P  is  given 
by, 


r  f  7  x  (z\_  d/i  df2  df2  df2  ,dfi  dfi 

l,  2  p(  )  (^pi  dpa  d6w  dpi  dp0 

=  V/1r-AV/2, 

0  -1  r 

where  A,/j  s  C°°(P),  V/,  =  (SA  §£  §£-)r  and  A  =  I  1  0  0 


(3.4.7) 


The  reduced 


-10  0 


Hamiltonian  H  is  given  by 


H(0w,po,pi)  =  H(0o,9i,po,pi), 


(3.4.8) 


since  the  matrix  M  in  (3.3.12)  is  a  matrix  of  functions  of  0io  .  The  reduced  dynamics  is 
then  immediately  given  by  reduced  Hamiltonian  vector  field 


Xfj  =  A  VH 


or,  explicitly,  by  differential  equations 


OH 


Po  = 


d0io 

dH 


P 1  =  - 
#io  = 


dB  io 

dH  dH 

dpi  dpQ ' 


(3.4.9) 
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Unlike  coupled  two-body  problem,  Equation  (3.4.9)  involves  complicated  analytic  expres¬ 
sions  resulting  from  the  substitutions  for  #3  and  82  in  terms  of  8q  and  8\  (cf.  Remark 
3.3.1).  Certain  qualitative  aspects  of  the  reduced  dynamics  can  still  be  explored  such  as 
relative  equilibria  which  we  will  investigate  in  the  foDowing  section. 

Remark  3.4.2:  For  non-Grashof  mechanisms,  following  Remark  3.3.1,  the  system 
still  has  symmetry  group  G  =  S1 .  It  is  clear  that,  in  this  case,  if  80  and  Q\  are  just 
parameters  of  configuration  space  S1  xS1 ,  instead  of  real  angles  of  0-th  and  1-st  bars,  all 
the  results  in  this  section  are  valid.  However,  for  convenience,  in  next  section  we  will  still 
use  physical  angles  0O  and  8\  for  non-Grashof  mechanisms.  Then,  80  and  8 1  will  play 
the  role  of  local  coordinates  for  51  X  51  and,  consequently,  (3.4.5),  (3.4.7)  and  (3.4.9) 
will  be  the  expressions  of  momentum,  reduced  Poisson  bracket  and  reduced  dynamics  in 
local  coordinates,  respectively.  I 

One  of  the  advantages  of  applying  reduction  theory  to  mechanical  systems  is  that 
it  helps  to  make  the  dynamics  of  the  system  more  transparent.  For  our  system  which  is 
of  four  dimensions  in  phase  space,  the  reduction  process  make  it  possible  to  display  the 
dynamics  by  phase  portraits  on  a  lower  dimensional  space.  To  illustrate  this,  we  show 
two  examples  here. 

Example  3.4.3: 

Consider  a  floating  four-bar  linkage  whose  parameters  satisfy  the  Grashof  condition 
and  which  is  of  lagging  form.  In  particular,  the  parameters  are  chosen  as  follows. 


do3 


mo  =  1,  mi  =  1,  m2  =  1,  m3  =  1; 

I0  =  1,  h  =  1,  h  =  1,  h  =  1; 

(-1.5,1),  doi  =  (1.5,1),  dio  =  (—0.5, 1.3),  dn  =  (0.5, 1.3), 
d21  =  (-1.5,1),  d23  =  (1.5,1),  d32  =  (—2,  A),  d30  =  (2,  A). 


Following  the  same  procedure  as  in  [44],  the  dynamics  can  be  further  reduced  to  a 
symplectic  leaf.  The  Hamiltonian  on  a  leaf  is  a  function  of  $io  and  p  =  ~  Po)  and 

the  dynamics  on  the  leaf  is  given  by 
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M,o  dH,„  ,  ip  an  , 

IT  =  Tt  =  ~a^('w) 

Therefore,  for  fixed  value  of  angular  momentum,  one  can  draw  the  phase  portrait  on  a 
cylinder  with  coordinates  01O  and  p.  Figure  3.4.1  shows  the  phase  portraits  for  A  =  0 
and  A  =  —  5,  respectively.  On  Figure  3.4.1(a),  we  see  there  is  one  center  and  one  saddle 
point.  However,  in  Figure  3.4.1(b),  there  are  two  centers  and  two  saddles.  The  difference 
is  caused  by  an  offset  in  the  position  of  center  of  mass  of  3rd  bar  from  the  line  connecting 
the  two  joints  on  that  bar.  I 


Example  3.4.4: 

Consider  a  floating  four-bar  linkage  whose  parameters  satisfy  the  non-Grashof 
condition  and  the  parameters  are  chosen  as  follows. 


m0  =  1,  mj  =  1,  m2  =  1,  m3  =  1; 

/o  =  1,  I\  =  1,  h  =  1,  h  =  1; 


d03  =  (-2,A),  d0i  =  (2,  A),  d10  =  (-1.5, -1),  d12  =  (1.5, -1), 

d2i  =  (  —  1.5,  —1.4),  d23  =  (1.5, -1.4),  d32  =  (-1.5, -1),  d30  =  (1.5, -1). 

Instead  of  displaying  the  dynamics  on  position-momentum  phase  space,  here  we  try 
to  show  it  on  position-velocity  phase  space.  Since  the  dynamics  cannot  be  further 
reduced  so  that  it  depends  on  one  relative  angle  and  corresponding  angular  velocity, 
one  has  to  display  the  phase  portrait  in  three  dimensional  space.  For  a  given  angular 
momentum,  the  phase  portrait  sits  on  a  surface  in  this  space.  In  this  example,  this  space 
is  parameterized  by  (0io,#,#o3)j  where  9  is  defined  in  (3.2.6).  Figure  3.4.2  shows  the 
phase  portraits  corresponding  to  A  =  —15  and  A  =  0,  respectively.  From  Figure  3.4.2, 
we  see,  again,  the  change  of  numbers  of  centers  and  saddles.  | 

In  the  next  section,  we  will  show  how  to  compute  the  centers  and  saddle  points 
and  associated  bifurcations. 
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(b)  A  =  0 

Figure  3.4.2  Phase  portraits  for  a  non- Grashof  linkage 
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3.5  Relative  Equilibria  and  Bifurcations 

We  first  give  the  definition  of  relative  equilibria  and  recall  Sraale’s  theorem  [42]. 
Consider  a  Hamiltonian  system  ( M,Q,Xh ),  where  M  is  a  2n- dimensional  man¬ 
ifold,  12  is  a  symplectic  two-form  on  M  and,  for  some  smooth  Hamiltonian  H  on  M , 
Xh  is  Hamiltonian  vector  field  determined  by  (2.1.9).  Let  G  be  a  Lie  group  acting 
on  M  symplectically  and  leaving  H  invariant.  The  following  definition  of  a  relative 
equilibrium  is  standard  [1]. 

Definition  3.5.1:  Let  Fx  be  the  flow  of  Xh  on  M .  Then  ze  E  M  is  a  relative 
equilibrium  if  FX[t(ze)  is  a  stationary  motion,  i.e.,  there  exists  f  €  Q  such  that 

FX„(ze )  ~  e*p(*0(2e), 

where  Q  is  the  Lie  algebra  of  the  group  G .  I 

Remarks  3.5.2: 

(1)  Let  X fj  be  the  Poisson  reduced  vector  field  as  shown  in  (2.1.21).  Then  ze  is  a 
relative  equilibrium  if  and  only  if 


*#(*(*«))  =  0, 

where  7r  :  M  — *  M/G  is  the  canonical  projection. 

(2)  A  physical  interpretation  of  relative  equilibria  is  that  if  the  dynamics  of  a  system 
is  rotationally  invariant,  the  dynamical  orbit  of  a  relative  equilibrium  appears  to 
be  a  fixed  point  for  an  observer  in  a  suitable  uniformly  rotating  coordinate  system. 

I 

Given  a  simple  mechanical  system  with  symmetry,  (Q,  K,  V,  G) ,  the  associated 
Hamiltonian  system  can  be  constructed  by  {T*Qi$Iq,Xh) ,  where  i2o  is  the  canonical 
two-form  and  the  Hamiltonian  H  is  given  by  (2.1.25).  The  symmetry  group  G  acts  on 
T*Q  by  lifting  (cf.  (2.1.3)).  The  following  theorem  is  due  to  Smale  [42]. 

Theorem  3.5.3:  For  a  simple  mechanical  system  with  symmetry  {Q,  K,  V,G),  define 
augmented  potential  function  by 

V*  :  Q  -  »  ;  g  V(q)  -  i/r(g)(6,(g),&,(g))  (3.5.1) 
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for  each  £  €  Q,  where  £g  is  infinitesimal  generator  of  the  action  corresponding  to  £. 
Then  ze  =  ( qe,pe )  £  T*Q  is  a  relative  equilibrium  if  and  only  if  qe  is  a  critical  point  of 
Vg  for  some  £  G  £  and  pe  =  A"b(g)(£Q(ge)). 

Remarks  3.5.4: 

(1)  It  can  be  shown  that,  for  a  given  £  G  Q ,  has  the  symmetry, 

U(«»W)  =  UM  (3-5-2) 

for  all  g  G  :=  {g  G  G\Adg£  =  £} .  If  the  action  $  is  free  and  proper,  then  the 
quotient  space,  Q/G^,  is  a  smooth  manifold  and  7r^  :  Q  — ►  Q/G ^  is  a  submersion. 
Thus  induces  a  function  V j  on  Q  /  G ^  such  that 

V*  =  V*  O  7T|. 

(2)  If  G  is  Abelian,  G^  =  G .  In  this  case,  we  refer  to  Q/G j  as  the  shape  space,  and 
the  points  7T $(?e)  as  the  relative  equilibrium  shapes.  It  can  be  shown  that,  in  this 
case,  if  7 r^(?e)  is  a  local  minimizer  of  V $ ,  the  corresponding  ze  =  ( qe,Pe )  is  a  stable 
relative  equilibrium,  and  if  7r^  ( qe )  is  a  local  maximizer  of  Vj ,  the  corresponding 
ze  =  (qe ,  pe )  is  an  unstable  relative  equilibrium. 

(3)  One  can  also  use  the  amended  potential  in  the  sense  of  Smale  to  study  the  relative 

equilibria  (see  [41]).  I 

Now  we  address  the  problem  of  computing  relative  equilibria  of  floating  four-bar 

linkages.  In  Section  3.4  we  have  shown  that  this  mechanism  is  a  simple  mechanical 
system  with  symmetry.  So  Smale’s  Theorem  3.5.3  is  applicable  to  the  system.  Recall 
that,  by  letting  £  =  £,  a  constant  in  Q  =  K,  the  infinitesimal  generator  of  action  (3.4.1) 
with  respect  to  this  £  is  £q(?)  =  £(1,  l)r.  Hence,  by  Theorem  3.5.3,  ( qe,pe )  is  a  relative 
equilibrium  point  on  T*Q  if  and  only  if  qe  is  a  critical  point  of  the  function 

Wo, 0i)  =  -?(!,  1)mQ), 

where  qe  =  ((0o)e,  (#i)e) ,  Pe  =  ((Po)e,  (pi)e)  and  matrix  M  is  defined  in  (3.3.12). 
Applying  Proposition  3.3.4  and  the  4x4  matrix  M  defined  in  (3.3.8),  we  have 

Wo,0i)  =  -£2eTMe,  (3.5.3) 
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where  e  =  (1  1  1  1)T.  Since  the  elements  of  matrix  M  are,  locally,  functions  of  6\q 
or  constants,  the  above  satisfies  (3.5.2)  for  all  g  £  51 .  It  follows  that  the  induced 
function  shown  in  Remark  3.5. 4(1)  is 

Wo)  =  Ve(0o,*i).  (3.5.4) 

Then,  the  critical  points  qe  of  V £  will  make  (0io)e  =  (#i)e  —  ($o)e  to  be  the  critical 
points  of  .  At  relative  equilibrium,  the  relative  angles  between  bars  keep  unchanged 
and  the  whole  system  rotates  around  the  system  center  of  mass  with  constant  angular 
velocity. 

Remarks  3.5.5: 

(1)  Following  Remark  3.3.1,  it  should  be  noted  that,  in  general,  given  the  value  of 
(<?io)e  one  cannot  tell  what  the  relative  equilibrium  shape  of  the  linkage  looks  like. 
The  particular  form  (leading  or  lagging)  has  to  be  indicated  at  the  same  time. 

(2)  given  in  3.5.4  is  the  locked  inertia  if  £  =  1 .  Its  value  at  8io  equals  the  value 

of  the  moment  of  inertia  of  the  corresponding  frozen  system,  i.e.,  the  system  with 
all  joints  locked,  about  its  center  of  mass.  The  above  result  shows  that  a  relative 
equilibrium  shape  corresponds  to  a  frozen  system  which  has  maximum  or  minimum 
value  of  moment  of  inertia  within  all  possible  frozen  systems.  Since  <£  is  a  constant, 
without  loss  of  generality,  we  let  £  =  1  later.  | 

In  the  rest  of  this  section,  we  are  particularly  interested  in  assemblies  which  admit 

configurations  with  reflection  symmetry,  which  will  be  called  symmetric  configuration. 
Applying  the  notations  in  Section  3.1,  a  floating  four-bar  linkage  is  of  symmetric  type  if, 
with  proper  consecutive  labeling  of  the  bars, 

=  m3,  l\  =  /3  (3.5.5a) 

|doi|  =  |do3|,  |dio|  =  |d3o|,  jd^l  =  |d32 1 ,  |d2i |  =  |d33|  (3.5.5b) 

and 

di°  (q  ^  d30  >  0.  (3.5.5c) 

In  other  words  it  can  form  symmetric  shapes  as  shown  in  Figure  3.5.1.  It  is  not 
hard  to  verify  that  any  four-bar  linkage  which  satisfies  (3-5.5)  has  two  such  symmetric 
configurations. 
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Figure  3.5.1  A  symmetric  configuration 


Although  it  is  not  easy  to  find  the  critical  points  of  Vj  analytically,  for  a  particular 
example  one  can  easily  find  them  numerically.  Unlike  the  planar  two  body  problem 
[43]  for  which  the  dimension  of  the  shape  space  equals  1,  here  the  function  has 
many  parameters.  A  natural  question  is  to  determine  how  these  parameters  affect  the 
relative  equilibria,  e.g.  their  numbers  and  location  on  shape  space,  etc..  Of  course,  it  is 
difficult  to  answer  this  question  for  completely  arbitrary  choice  of  parameters.  However, 
by  leaving  one  or  two  particular  parameters  free  such  that  the  assembly  preserves  its 
symmetric  configurations  and  freezing  all  other  parameters,  one  still  can  observe  a 
nontrivial  bifurcation  phenomenon.  To  illustrate  this  we  consider  an  example. 
Example  3.5.6:  Let  us  choose  the  parameters  as  follows. 

77lo  =  1,  771 1  =  1,  7712  =  1?  771 3  =  U 

do3  =  (— 2,  Ao),  doi  =  (2,  Ao),  dm  —  (—1.5,  —1),  di2  =  (1.5,  —1), 
d2i  =  (—1.5,— 1-4),  d23  =  (1.5, -1.4),  d32  =  (-1.5, -1),  d30  =  (1.5, -1). 


Now  the  assembly  has  non-Grashof  structure. 


Using  Vf' ,  for  any  Ao  one  can  find  relative  equilibria  (#io)e  for  both  leading  form 
and  lagging  form,  and  hence,  corresponding  9  which  is  defined  in  (3.2.6).  As  A0  varies 
from  —oo  to  +co ,  one  can  plot  a  diagram  for  8  versus  Ao .  Figure  3.5.2  shows  the  result, 
in  which  solid  dots  represent  stable  relative  equilibria,  small  circles  represent  unstable 
relative  equilibria.  I 


-10  0  10  20 
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Figure  3.5.2  Bifurcation  diagram:  an  example 

From  this  example  one  can  make  the  following  empirical  observations: 

(1)  There  are  two  unbounded  symmetric  branches  on  the  diagram  and  and  these 
branches  are  bifurcated  at  some  points.  The  bifurcations  appear  to  be  pitchfork 
bifurcations. 

(2)  Almost  any  value  of  8  can  be  a  relative  equilibrium  for  a  particular  A0 .  In  other 
words  the  bifurcation  diagram  is  connected  globally. 

(3)  The  number  of  relative  equilibria-  can  be  two,  six  and  ten. 

The  first  observation,  which  relates  to  the  local  bifurcation  problem,  is  what  we 
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will  concentrate  on  in  the  rest  of  this  section.  The  others  will  be  discussed  later. 

As  we  have  seen,  the  function  of  a  four-bar  linkage  is  a  multiple  parameter 
function.  One  might  expect  very  complicated  bifurcation  features  with  respect  to  these 
parameters.  Here,  instead  of  considering  a  general  structure,  we  study  a  special  assembly 
which  admits  a  symmetric  configuration.  To  avoid  too  many  tedious  calculations  we 
particularly  choose  the  parameters  of  the  assembly  as  follows. 


mo  =  m\  = 

m2  =  m3  =  1; 

(3.5.6a) 

do3  =  (— do,  Ao), 

doi  =  (do,  Ao), 

dio  =  (—1,0), 

di2  =  (1,0), 

(3.5.6b) 

^21  =  (— d2,  A2), 

^23  =  (^2>A2), 

d32  =  (  — 1,0), 

d30  =  (1,0), 

(3.5.6c) 

where  do  and  d2  are  fixed  and  do  >  d2  >  0,  Ao,A2  6  3i.  Moreover,  we  consider  the 
non-Grashof  case  only,  i.e.,  s  +  1  >  p  +  q. 

Figure  3.5.3  shows  two  symmetric  configurations  for  the  above  choice  of  parameters. 
We  will  see  that  although  only  two  parameters  A0  and  A2  are  left  to  be  free,  the 
bifurcation  features  with  respect  to  these  parameters  are  still  informative. 

The  function  now  has  the  following  form: 

V, c  =  i(2A2«n(032)  +  d2cos(632)  -  cos(&3 1)  +  docos(03 0) 

+2A2.sm(02i)  +  d2cos(62i)  -  d0d2cos(d20 )  +  d0cos(6l1o)) 

+  ^-(2-sin(03O)  +  \2cos(020)  -  2sin(6w))  +  C,  (3.5.7) 

where  C  is  a  constant  determined  by  dtJ,  ro;  and  the  moments  of  inertia  of  the  bodies, 
For  i,j  =  0,  1,  2,  3,  9{j  =  6{  —  9j  satisfy  the  constraint  equations 

d0  +  cos(0io)  +  d2cos(920 )  +  cos(930)  =  0  (3.5.8a) 

sin(6io)  +  d2sin(82o)  +  sin(d30 )  =  0.  (3.5.8b) 

In  the  following,  at  symmetric  configuration,  the  variables  will  be  denoted  by 
superscript  “s”  (say,  #f0),  the  formulas  will  be  denoted  by  “|s”  (say,  /(#io)|s)-  As 
shown  in  the  example,  the  bifurcation  diagram  of  relative  equilibria  will  be  parameterized 

by  (Mo). 
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Figure  3.5.3  Symmetric  configurations 

Theorem  3.5.7:  For  a  floating  four-bar  linkage  with  parameters  shown  in  (3.5.6),  the 
bifurcation  diagram  of  relative  equilibria  has  the  following  properties: 

(1)  There  are  two  infinite  branches  in  the  diagram,  one  corresponds  to  in  the 
leading  form,  another  one  corresponds  to  in  the  lagging  form.  We  refer  to  these  as 
the  symmetric  branches. 

(2)  There  exists  a  constant  A£  such  that  no  bifurcation  occurs  on  the  symmetric 
branch  of  leading  form  if  At  =  A?;  and,  no  bifurcation  occurs  on  the  symmetric  branch 
of  lagging  form  if  At  =  -At. 

(3)  On  the  symmetric  branch  of  leading  (lagging)  form,  if  At  <  A£  (— Aj),  there 
exists  a  constant  c\  (03)  such  that  the  relative  equilibria  are  stable  for  A0  <  c\  (03), 
unstable  for  Ao  >  ci  (03),  bifurcated  for  Ao  =  c\  (03);  one  the  other  hand,  if 
A2  >  At  (— AJ),  there  exists  a.  constant  («.*)  such  that  the  relative  equilibria  are 
unstable  for  A0  <  c2  (04),  stable  for  Aq  >  c2  (c4),  bifurcated  for  Aq  =  c\  (c4). 


0 1 


(4)  Assume  A2  ±A2 .  Let 


±  ±ei  A2  +  e2A2±t3 
e  —  sgn- 


^4  A2if5 


and 


S±  =  sgn(S1\2±S2), 


(3.5.9) 


(3.5.10) 


where  e,  and  Si  are  constants  which  are  determined  by  do  and  d2 .  Then,  the  bifurcation 
on  the  symmetric  branch  of  leading  form  will  be  supercritical  pitchfork  if  e+S+  <  0.  It 
will  be  subcritical  pitchfork  if  e+S+  >  0.  Similarly,  the  bifurcation  on  the  symmetric 
branch  of  lagging  form  will  be  supercritical  pitchfork  if  e~S~  <  0.  It  will  be  subcritical 
pitchfork  if  e~S~  >0. 

Remarks  3.5.8: 

(1)  Based  on  the  techniques  of  bifurcation  theory  shown  in  Appendix  3.A,  the  proof 
of  the  above  assertions  is  elementary.  However,  two  aspects  have  to  be  considered 
before  the  bifurcation  theory  is  applied.  First,  one  has  to  determine  the  existence 
of  a  bifurcation  and  where  the  branches  are  bifurcated.  Secondly,  applying  the 
techniques  of  bifurcation  theory  shown  in  Appendix  3.A  will  involve  as  high  as 
fourth  order  derivatives  of  the  function  Vj  with  respect  to  0U) .  However,  as  we  have 
shown  in  Remark  3.3.1,  matrix  M  depends  on  6w  in  a  very  complicated  way  which 
makes  explicit  representing  of  high  order  derivative  of  to  6 10  almost  impossible. 
Therefore,  when  applying  the  techniques  of  bifurcation  theory  to  this  particular 
system,  one  should  consider  the  closure  constraint  equation  (3.5.8)  simultaneously. 

(2)  Although  the  proof  of  the  above  assertions  is  elementary,  it  requires  a  large  effort  in 

calculations.  We  used  MACSYMA  to  handle  these  computations.  In  the  following, 
we  only  give  a  sketch  of  the  proof.  g 

Proof  of  Theorem  3.5.7 :  Note  that  the  function  V £  can  be  written  as  a  function  of 
relative  angles  0io,  02 o,  and  #30  ,  which  are  related  through  constraint  equations  (3.5.8). 
From  (3.5.8),  we  can  consider  (locally)  02O  and  03o  as  the  functions  of  01O.  Again,  from 
(3.5.8)  one  can  generate  the  quantities  ^22-| a  and  C^22-| s  for  any  positive  integer  i. 
Moreover,  from  Figure  3.5.3  it  is  easy  to  see  that  at  symmetric  configuration 

02o  =  *  and  0*o  =  -0|o-  (3.5.11) 
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With  above  considerations,  one  can  have  closed  form  expressions  for 
For  instance, 


d'e, 

de\, 


and  d' 03g,  | 
9S‘0  I 


and 


f  9020.1  -  2 

1  80io  >*  ~  d2 

]  9030  I  _  1 

l  90 10  1*  -  1 

_  2cos2{0)n) 

»«?o 

s  d'2sin(810 ) 

_  2cos(8’„) 

Is  d2sin(${0 ) 

(3.5.12) 


(3.5.13) 


and  so  on. 

To  prove  assertion  (1)  in  the  statement  of  the  theorem,  one  needs  to  show  that  at 
symmetric  configuration,  the  equation 


^1  =0 
de  ls 


10 


does  not  depend  on  Ao-  Concentrating  on  the  term  involving  Ao  in  V$  and  applying 
(3.5.11)  and  (3.5.12),  one  can  show  that  the  first  derivative  of  that  term  with  respect  to 
Q\o  at  symmetric  configuration  is  zero.  Since  the  rest  of  the  terms  of  —  0  are  still 

functions  of  0fo  ,  one  can  see  two  infinite  symmetric  branches  in  the  bifurcation  diagram 
for  two  different  9f0  .  Assertion  (1)  is  thus  proved. 

Applying  (3.5.12)  and  (3.5.13),  one  can  show  that  the  second  derivative  of  the 
function  Vj  at  symmetric  configuration  has  the  form 
0% 


del. 


d2n,  ,  f  d2  -  2cos3(0fo)  ,  x  cos2(0fo), 


de\. 


d\ 


(3.5.14) 


disin(0 j0) 

where  II  is  the  summation  of  the  terms  not  involving  Aq  in  .  It  is  obvious  that  when 


A,  =  A*  = 


a  2do cos2  (#f0)  —  d\ 
cos2(9’0)sin(9sw )  ’ 


(3.5.15) 


will  not  depend  on  Ao.  One  can  also  show  that  with  (3.5.15),  0  under 

OVl0 

assumptions  (3.5.6)  and  (3.5.7).  This  means  that  bifurcation  may  not  occur  on  either 
symmetric  branch  of  leading  form  or  symmetric  branch  of  lagging  form.  Note  that  on 
these  different  forms  cos(9f0)  has  the  same  value,  sin(9f0 )  has  the  same  absolute  value 
but  different  sign.  (See  Figure  3.5.3)  Thus,  assertion  (2)  is  proved. 

As  we  have  known  earlier,  the  stability  of  relative  equilibria  depends  on  the  sign 
of  airrHs'  From  (3.5.14)  we  see  that  L  is  a  linear  function  of  Ao .  Using  the  A£  in 

C^io  ^10 

(3.5.15),  the  proof  of  (3)  is  straight  forward. 
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To  prove  assertion  (4),  we  apply  the  bifurcation  theory  mentioned  in  Appendix  3.A. 
Let  Aq  denote  Ci  in  assertion  (3)  for  some  suitable  i.  One  can  show  that  at  (0io,Ag), 


d%  d2Vs  d3Vz  d2Vt 

O  f\  A  rtl  A  /l1}  """  A  n  A  \  • 

3010 


30io 


30fo  30m  3A 


Moreover 


and 


34h^  ,ns  X*N  _  dlCi  A2  +  €2A2±63 

304o(0i°^o)-  e4A2±e5 


(3.5.16) 


(3.5.17) 


(3.5.18) 


where  “  +  ”  corresponds  to  leading  form,  “  —  ”  corresponds  to  lagging  form  and 

€1  =  — 12dg(d2  —  dg)^(d2  —  dgd^  —  dgd2  —  2c?2  +  dg  —  4dg); 


e2  =  48dgd2((d2  -  do)2  -  l)(dl  -  d2Q)y/A  ~  (d0  -  d2)2; 

€3  =  48do^(^2  -  d0)2(dl  -  d0dl  -  dgd2  -  4d2  +  d$  -  2d0); 
e4  =  dj((d2  —  d0)2  -  4)(d2  —  ^o)2\/4  —  (do  -  d2)2; 
e5  =  2d2((d2  —  do)2  —  4)((d2  —  do)3  —  4d2); 


01  =  - 


^2  =  ~ 


(d2  ~  00 )2  , 


4d| 


(d2  4*  3dgd2  —  3dgd2  4*  4d2  +  dg) 
2d2  -\/4  —  (dg  —  d2)2 


Since  (3.5.17)  and  (3.5.18)  are  not  zero  in  general,  applying  the  Lemma  3.A.3  in  Appendix 
3. A,  we  can  say  is  strongly  equivalent  to  the  normal  form  of  pitchfork  bifurcation. 

In  addition,  the  type  of  pitchfork  bifurcation  depends  on  the  sign  of  (3.5.17)  and  (3.5.18). 
The  assertion  (4)  is  proved.  | 

Remarks  3.5.9: 

(1)  The  condition  of  A2  ^  A2  guarantees  that  (3.5.18)  and  the  denominator  of  (3.5.17) 
are  not  zero. 

(2)  In  general  the  bifurcation  changes  from  a  supercritical  one  to  subcritical  one  at  the 

roots  of  numerator  of  (3.5.17).  | 

Example  3.5.10:  To  see  how  A2  changes  the  bifurcation  diagram  with  parameter  Ao 
we  give  following  example.  We  will  concentrate  on  the  symmetric  branch  with  respect 
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to  leading  form.  Let  do  =  2  and  d2  =  1.  Then  e  and  S  have  the  following  form 


e 


+ 


+  6.938A2  -  3.623 
1.873-  0.051A2 


6+  =  sgn(1.025  —  0.028A2) 


Then 


+  if  -7.426  <  A2  <  0.488; 

—  (  +1,  otherwise. 

Note  that  the  region  for  A2  is  an  approximation.  So  we  can  say  that,  when  A2  6 
(—7.426,  0.488),  the  pitchfork  bifurcation. is  supercritical.  Otherwise,  it  is  subcritical. 
Figure  3.5.4  shows  this  result.  I 


Before  closing  this  section,  we  would  like  to  make  a  few  additional  remarks. 

Remarks  3.5.11: 

(1)  Although  our  discussion  only  concentrated  on  a  structure  of  non-Grashof  type,  a 
version  of  Theorem  3.5.7  also  holds  for  the  Grashof  case. 

(2)  Up  to  now  we  have  understood  the  phenomenon  of  bifurcation  on  the  symmetric 
branches.  The  global  analysis  of  the  bifurcations  involves  massive  symbolic 
computations.  However,  as  shown  in  the  Example  3.5.6  in  this  section  and  other 
simulations,  one  can  numerically  determine  a  global  bifurcation  diagram.  A  large 
body  of  such  numerical  simulations  show  that  the  branches  in  the  bifurcation 
diagram  are  connected.  In  other  words,  for  any  point  in  shape  space,  there  is  a 
finite  A0  for  which  that  shape  determines  a  relative  equilibrium.  This  is  consistent 
with  the  linearity  of  in  A0 .  This  property  provides  a  possibility  to  control  the 
attitude  of  a  space  structure  with  a  closed  kinematic  chain  by  simply  changing  the 
position  of  the  centers  of  mass  of  some  bars. 

(3)  Our  results  in  this  paper  rely  on  some  ideal  conditions,  for  instance,  the  symmetry 

condition  (3.5.5),  and  the  absence  of  external  and  internal  disturbances.  One 
may  ask  what  will  happen  when  these  conditions  are  violated.  The  answer  to 
this  question  may  relate  to  the  notion  of  universal  unfolding  in  bifurcation  theory. 
Numerical  results  show  that  it  is  possible  to  use  the  unfolding  property  to  control 
the  shape  of  the  structure  near  the  bifurcation  point.  ■ 
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Appendix  3. A 

In  this  appendix,  we  review  some  basic  concepts  and  results  of  bifurcation  theory. 
The  standard  reference  is  [14]. 

Consider  a  single  scalar  equation 

ff(x,A)  =  0.  (3.A.1) 

The  bifurcation  theory  studies  how  the  solutions  x  of  this  equation  change  with  the 
parameter  A;  or,  more  precisely,  what  type  of  bifurcation  occurs  with  parameter 
A.  Without  loss  of  generality,  one  can  assume  <?(0,0)  =  0.  Moreover,  we  assume 
g  :  5ft  x  3J  — *  is  smooth.  This  is  one  of  the  standard  local  (static)  bifurcation  problems 
with  one  state  variable,  called  the  recognition  problem ,  As  with  many  bifurcation 
problems,  this  problem  can  be  solved  successfully  through  singularity  theory,  in  which 
the  related  issue  is  called  finite  determinacy.  Let  z  =  (x,A).  Near  origin,  the  function 
g  can  be  written  as 

d(z)  =  £  +  E  a«(z)A  (3.A.2) 

|a|<fc+l  |a|=fc+l 

for  some  smooth  functions  aa  defined  in  a  neighborhood  of  the  origin.  Here  we  used  the 
conventions  with  multi-indices: 


|aj  =  ai  +  a2, 


a!  =  (ax)!(a2)!, 


A  key  question  is  what  terms  in  (3.A.2)  can  be  ignored  such  that  the  values  of  coefficients 
of  remaining  terms  can  be  used  to  determine  the  qualitative  behavior  of  the  original 
equation  (3.A.1),  for  example,  the  variation  in  number  of  solutions.  Singularity  theory 
solves  this  problem  by  finding  a  suitable  change  of  coordinates  such  that  function  g  is 
equivalent  to  a  standard  model  h ,  called  normal  form.  A  precise  definition  is  given  as 
follows  (see  [14]). 

Definition  3.A.1:  Two  smooth  mappings  g,  h  :  SR  X  3?  -*  defined  near  the  origin  are 
equivalent  if  there  exist  a  local  diffeomorphism  of  3?2 ,  (x,A)  t-*  (X(x,  A),  A(A))  at  the 
origin  and  a  nonzero  function  S(x,  A),  such  that 

g(x,A)  =  S(x,  A)h(X(x,  A),  A(A)),  (3. A. 3) 
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where  Xx(0,0)  >  0  and  A'(0)  >0.  If  A  =  A,  g  and  h  are  strongly  equivalent.  g 

From  this  definition  we  see  that,  since  5(a;,  A)  is  nonzero,  the  solution  of  g(x,  A)  =  0 
and  h(X,  A)  =  0  are  the  same  in  the  sense  of  diffeomorphism.  From  this  point  of  view,  by- 
means  of  singularity  theory  one  can  show  why  and  what  the  high-order  terms  in  (3.A.2) 
do  not  effect  the  qualitative  behavior  of  equation  g(x,  A)  =  0.  It  should  be  noticed  that 
although  this  method  does  not  tell  us  how  to  derive  an  appropriate  normal  form  h ,  for 
most  physical  problems,  such  as  the  one  considered  in  this  paper,  it  is  not  hard  to  pick 
up  some  of  the  candidates  from  a  large  number  of  known  simple  polynomials  of  x  and 
A ,  or  the  model  of  normal  forms  which  have  standard  bifurcation  diagrams.  This  is  in 
essence  the  spirit  of  application  of  singularity  theory  to  a  physical  problem. 

Without  considering  detailed  issues  of  singularity  theory  which  are  applicable  to 
bifurcation  problems,  we  directly  give  the  following  result  which  will  be  used  in  the  next 
section.  For  details  see  [14],  chapter  2.  First  we  need  the  concept  of  germs. 

Definition  3. A. 2:  Two  smooth  functions  defined  near  the  origin  are  equivalent  as  germs 
if  there  is  some  neighborhood  of  the  origin  on  which  they  coincide.  Let  £Xi\  denote  the 
set  of  equivalence  classes  of  such  functions.  The  elements  in  £x<\  are  called  germs.  g 
Lemma  3. A. 3:  A  germ  g  G  £x,\  is  strongly  equivalent  to 

exk  +  SXx  (3.A.4) 


for  k  >  2  if  and  only  if  at  x  =  A  =  0 

and 

d  d  d 

<  =  sgni-fg,  S  =  sgn--g.  (Z.A.Sb) 

■ 

Remark  3. A. 4:  When  k  =  3  the  normal  form  (3.A.4)  provides  a  pitchfork  bifurcation. 
From  this  lemma,  so  is  g  if  (3. A. 5)  holds.  It  is  easy  to  show  that  if  e8  >  0,  the  pitchfork 
bifurcation  is  subcritical;  if  e8  <  0,  the  pitchfork  bifurcation  is  supercritical.  | 
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CHAPTER  IV 


OPTIMAL  CONTROL  PROBLEM  ON 
A  RIGID  BODY  WITH  TWO  OSCILLATORS 


An  interesting  problem  in  multibody  mechanics  is  the  problem  of  nonholonomic 
motion  planning,  or  kinematic  control  problem.  In  recent  research  on  various  multibody 
mechanical  systems  with  symmetry,  the  theory  of  principal  bundles  with  connections 
has  led  to  clear  insight  into  the  geometric  structure  of  the  problem,  and  provided  a 
common  framework  for  the  formulation  of  related  optimal  control  problem.  However, 
explicit  or  partially  explicit  solution  to  the  necessary  conditions,  given  by  differential 
equations  on  phase  space,  for  the  optimal  path  and  control  is  still  a  challenge.  Although, 
under  certain  conditions,  the  symmetries  of  the  systems  imply  the  existence  of  conserved 
quantities  for  the  differential  equations  given  by  the  necessary  conditions,  working  with 
local  coordinates  at  an  early  stage  of  the  analysis  usually  causes  difficulties  in  uncovering 
such  quantities.  In  this  chapter,  we  consider  a  particular  mechanical  system  consisting  of 
a  rigid  body  and  two  point-masses,  for  which  the  structure  group  of  the  principal  bundle 
is  non-Abelian.  For  such  mechanical  systems,  formulating  the  necessary  condition  for 
the  optimal  control  in  an  intrinsic  way  is  no  longer  trivial.  We  derive  this  condition  by  a 


65 


perturbation  method  and  a  Hamiltonian  formulation  and,  based  on  structure  symmetry 
and  localization,  explore  the  explicit  solvability  of  optimal  control  for  this  system  from 
a  Hamiltonian  point  of  view.  The  procedures  of  symplectic  and  Poisson  reduction  are 
applied  systematically  for  this  purpose. 

4.1  Preliminaries 

In  this  section,  we  give  some  mathematical  background  for  this  and  next  chapters. 
We  first  recall  some  useful  definitions  and  results  about  principal  fiber  bundles  and  the 
theory  of  connections  in  differential  geometry.  The  main  references  for  this  part  are 
[6,33].  Then,  we  show  an  important  principal  connection  for  the  simple  mechanical 
system  with  symmetry,  i.e.,  mechanical  connection,  which  is  originally  due  to  Smale  and 
Rummer  (cf.  [24]). 

4.1.1  Principal  Fiber  Bundle  and  Connections 

Definition  4.1.1:  Let  B  and  Q  be  smooth  manifolds,  referred  to  as  the  base  space 
and  the  total  space,  respectively.  Let  G  be  a  Lie  group,  referred  to  as  structure  group. 
A  four-tuple  p  =  ( Q,B,v,G )  is  called  a  principal  fiber  bundle  or  principal  G -bundle  if 
the  following  conditions  are  satisfied: 

(a)  G  acts  on  Q  to  the  left,  freely  and  differentiably,  where  the  action  is  denoted  by 

$:GxQ^Q 

A  A  <4'U) 

(9,l)  ►»  4(9.9)  =  *»(9)  =  9 ' 9; 

(b)  B  is  the  quotient  space  of  Q  by  the  equivalence  relation  induced  by  G,  i.e., 
B  =  Q/G ,  and  the  canonical  projection  7r  :  Q  — *■  B  is  differentiable; 

(c)  Q  is  locally  trivial,  that  is,  for  each  x  G  B  there  is  a  neighborhood  U  of  x  such 
that  7r — 1  ( f/)  is  isomorphic  with  U  xG  in  the  sense  that  q  G  x-1(17)  (x (q),  4>(q)) 
is  a  diffeomorphism,  where  (j) :  x-1(C/)  — >  G  satisfies  4>(g  ■  q)  =  g4>{q),  V(/  G  G.  | 

Remark  4.1.2:  In  the  traditional  definition  of  principal  fiber  bundle,  the  structure 
group  G  acts  on  Q  to  the  right.  In  general,  the  choice  of  left  or  right  actions  is  related  to 
the  properties  of  the  physical  system  under  study.  Therefore  in  this  and  next  chapters, 
we  will  also  show  the  corresponding  formulations  for  right  action  whenever  it  is  necessary. 

■ 
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For  x  G  B ,  the  fiber  over  2  is  a  closed  submanifold  of  Q  which  is  differentiably 
isomorphic  with  G .  For  any  point  q  G  Q  ,  the  fiber  through  q  is  the  fiber  over  2  =  n(q) . 
The  trivial  or  product  principal  fiber  bundle  is  a  special  principal  fiber  bundle,  in  which 
Q  =  B  x  G .  In  this  case,  G  acts  on  Q  by  $(5,  (2,  h ))  =  (2,  gh )  for  2  G  B  and  g,h  G  G . 

Let  V  be  an  r  -dimensional  vector  space  with  basis  «i ,  •  •  ■ ,  vr .  If  oq ,  •  •  • ,  ar  belong 
to  zsk(Q),  the  space  of  real- valued  A: -forms,  then  ^[=1  arvr  is  called  a  V -valued  k- 
form.  The  space  of  V -valued  A; -forms  on  Q  is  denoted  by  vs k(Q]  V ) .  Let  u  G  zsk(Q\  V ) . 
As  a  real- valued  A; -form,  its  exterior  derivative  satisfies  the  following  important  equation 
(Cartan’s  formula): 


k 

du(X0,  ■■■,Xk)=  £(-l)%(w(X0,  Xk)) 

*= 0 

+  ^2(-l)i+^([Xi,Xj],Xo,---,Xi,---,Xj,---,Xk), 
«<i 


(4.1.2) 


where  Xt-  G  3>(Q),i  =  0,  •  -  • ,  A:  and  X{  denotes  that  Xi  is  deleted.  In  particular,  if 
u  6  wx(Q]V), 

(du)(X,  Y )  =  X(u(Y))  -  T(w(X))  -  w([  X,  Y  ] ),  (4.1.3) 


where  X,  Y  G  X(Q).  Note  that,  here  we  used  the  convention  of  exterior  derivative  in 
[1],  instead  of  the  one  in  [33]. 

We  now  ready  to  define  connections  on  a  principal  fiber  bundle. 

Definition  4.1.3:  A  connection  on  a  principal  fiber  bundle  p  =  ( Q,B,n,G )  is 

an  assignment  to  each  point  q  G  Q  a,  tangent  subspace  Hg  C  TqQ  such  that,  for 
V,  =  {t;  £  TqQ  |  Tir(v)  =  0} ,  we  have 

(a)  TqQ  =  H?  ©  Vg; 

(b)  for  each  g  G  G  and  q  6  Q ,  T?$5  •  Hg  =  B.g.q ; 

(c)  Hg  depends  differentiably  on  q.  | 

In  the  above  definition,  the  subspace  Vg  is  called  the  vertical  subspace  of  TqQ 

and  H q  the  horizontal  subspace.  Figure  4.1.1  gives  a  clear  picture  of  the  definition  of 
connection. 
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Figure  4.1.1  A  Connection  (cf.  [6]) 

From  the  above  definition  of  connection,  the  decomposition  of  the  tangent  space  of 
Q  leads  to  the  decomposition  of  the  vector  fields  on  Q  as  follows.  Let  X  be  a  smooth 
vector  field  on  Q  ,  from  condition  (a)  in  the  above  definition,  at  each  point  q  6  Q  , 

X(q)  =  Xv(q)  +  Xh(q), 

where  Xv(q)  6  V?  and  Xh(q)  €  H?.  From  the  condition  (c),  by  associating  to  each 
point  q  the  tangent  vector  Xv(q)  and  Xh(q),  we  get  smooth  vector  fields  Xv  and  Xh 
which  are  called  vertical  and  horizontal  component  of  vector  field  X ,  respectively.  If 
Xv  =  X  or  Xh  =  X ,  X  is  called  vertical  or  horizontal  vector  field,  respectively. 

From  the  properties  of  horizontal  subspace,  we  see  that  a  connection  defines  a 
subbundle  over  Q  of  TQ  ,  i.e.,  a  (horizontal)  distribution  on  Q .  Moreover,  since  the 
fiber  over  any  point  in  Q  is  differentiably  isomorphic  with  Lie  group  G,  it  is  easy  to 
verify  that  V?  =  {^q(<z)|^  €  G} ,  where  is  infinitesimal  generator  of  action  given  in 
(4.1.1)  with  respect  to  £  €  Q  or  fundamental  vector  field  on  Q ,  and  the  vertical  vector 
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field  Xv  can  be  represented  as,  at  each  point  q  £  Q 


Xv(q )  =  mu*), 

where  £(q)  is  a  choice  of  an  element  in  (7  at  q. 

An  alternative  way  to  define  a  connection  is  as  follows. 

Definition  4.1.4:  Let  (7  be  the  Lie  algebra  of  G .  A  connection  on  a  principal  fiber 
bundle  <p  —  (Q,B,tt,G)  is  a  (/-valued  1-form  on  Q ,  u>  £  ^1(Q;  (7) ,  such  that 

(1)  w(fQ(g))  =  £,  for  any  £  6  Q] 

(2)  (($3)*u;)(X)  =  Adgu{X)  for  any  X  6  X{Q). 

A  (7-valued  1-form  u  satisfying  (1)  and  (2)  is  referred  to  as  the  connection  form.  g 
Remark  4.1.5:  Because  of  the  condition  (2)  in  Definition  4.1.4,  we  say  that  the 
connection  form  is  Ad-equivariant.  If  the  Lie  group  G  acts  on  Q  to  the  right,  the 
condition  (2)  becomes 


((*fl)*w)(X)  =  Adg-iu(X),  VX  e  X(Q).  | 


One  can  show  that  Definition  4.1.3  and  4.1.4  are  equivalent  (cf.  [6]),  that  is, 
given  a  connection  form,  u,  there  exists  an  unique  decomposition  of  TqQ  or  a  choice  of 
horizontal  subspace  H,  =  Ker(u(q))  at  each  point  q  6  Q ;  conversely,  given  a  choice  of 
horizontal  subspace,  there  exists  an  unique  (7-valued  1-form  satisfies  the  conditions  in 
Definition  4.1.4. 

One  should  note  that,  for  a  given  principal  fiber  bundle,  there  exists  an  infinity  of 
choice  of  horizontal  subspace/connection.  This  obvious  claim  turns  out  to  be  important 
in  the  next  chapter  where  we  consider  a  family  of  splittings  of  the  dynamics  of  Lagrangian 
system. 

Let  u  6  zjk(Q;Q).  The  covariant  derivative  of  ui ,  Du  6  wk+l{Q',Q) ,  is  defined 


by 


Du/(X0,  •  •  • ,  Xif)  =  <mx0\...,x£), 


where  Xf  is  horizontal  component  of  vector  field  X,-  €  35{Q)  .  If  u  £  zu1(Q-Q)  is 
the  connection  form  of  a  connection,  the  covariant  derivative  of  u,  =  Du>,  is  called 
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curvature  form  of  the  connection.  We  give  three  important  properties  of  the  curvature 
form  below.  For  proof,  see  [33]. 

Proposition  4.1.6:  Let  u  £  vox{Q\Q)  be  the  connection  form  and  fi  be  the 

corresponding  curvature  form.  Let  X,  Y  be  any  vector  fields  on  Q .  Then,  we  have 

(1)  (($3)*fi)(X,y)  =  Ad3fi(X,F); 

(2)  Structural  equation: 

Q(X,  Y)  =  MX,  Y)  +  [  u(X)MY)  ] ;  (4.1.4) 

(3)  a(X,Y)  =  -u([Xh,Yk]).  I 

If  a  connection  is  determined  on  a  principal  fiber  bundle  p  =  (Q,  B,n,G ),  certain 

types  of  curves  on  Q  can  be  characterized.  A  smooth  curve  {g(t),t  £  [0,1]}  on  Q  is 
called  a  horizontal  curve  if  its  tangent  vectors  are  all  horizontal,  i.e.,  £  H g(t),Vt  £ 

[0,1].  Let  {x(t),t  £  [0,1]}  be  a  piecewise  smooth  curve  in  B.  A  horizontal  lift  of  x(-), 
{q(t),t  £  [0, 1]} ,  is  a  curve  in  Q  which  is  horizontal  and  7r(g(f)))  =  x(t),  Vf  £  [0, 1] .  The 
following  theorem  is  crucial  for  understanding  the  holonomy  or  the  geometric  phases. 
Theorem  4.1.7:  Let  {x(t),t  £  [0, 1]}  be  a  curve  in  B  and  let  q0  be  any  point  in  Q 
such  that  7r(go)  =  x(0).  Then,  there  is  a  unique  horizontal  lift  {q(t),t  £  [0,1]}  which 
starts  at  q(0)  =  go  •  I 

The  proof  of  this  theorem  uses  the  existence  and  uniqueness  of  the  solution  of  an 
ordinary  differential  equation  and  can  be  found  in  [33]. 

From  Theorem  4.1.7,  one  can  define  a  mapping,  called  parallel  displacement , 
if  :  go  =  g(0)  1-+  gi  =  g(l).  It  is  easy  to  check  that  parallel  displacement  is  a  differential 
isomorphism  of  the  fiber  through  g(0)  onto  the  fiber  through  g(l)  and  it  commutes  with 
the  group  action  <f> ,  i.e.,  ^{^(g^q))  —  $(g,^(g))  for  any  g  £  G  and  g  £  Q . 

Consider  a  closed  curve  in  5,  {x(f),f  £  [0,1]}  with  x(0)  =  x(l)  =  xo  .  The  parallel 
displacement  now  is  an  automorphism  of  the  fiber  over  x0 ,  7r-1(xo).  The  set  of  all  of 
such  automorphisms  corresponding  to  all  closed  curves  at  xo  in  B  forms  a  group,  called 
the  holonomy  group  at  xo ,  denoted  by  .  One  shows  that  '&XQ  can  be  identified  as  a 
subgroup  of  the  structure  group  G.  An  element  in  $j;o  is  called  a  holonomy  at  xo . 

Given  a  connection  on  a  principal  fiber  bundle  p  =  (Q,  B,tt,G),  we  now  can 
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compute  the  holonomy  corresponding  to  a  closed  curve  in  B ,  say,  x(-)  =  G  [0, 1]}  , 

and  a  point  go  G  7r-1(a;(0))  C  Q.  Assume  {x(t),t  G  [0,1]}  is  contained  in  an  open  set 
U  of  B.  Let  a  :  U  — +  Q  be  a  local  section  of  the  bundle  and  u>  G  tzl(Q',G)  be  the 
connection  form.  Then  the  holonomy  at  go  with  respect  to  x(-)  is  the  solution  of  the 
following  differential  equation  on  G  at  t  —  1 : 

^21  =  -T.L,  .(»V)(i(t)).  (4.1.5) 

Indeed,  let  g(-)  =  {g(f),f  G  [0,1]}  be  the  horizontal  lift  of  x(- )  with  g(0)  =  go 
and  g(- )  =  {g(t),t  G  [0,1]}  be  a  curve  in  G  such  that  q(t)  =  $(g(t),  = 

$g(t)(<r(x(t))),Vt  G  [0,1].  Then, 

~ ^  =  g(i)Tx(t)<yx(t)  +  Tcr(x(j))$3(j)^Q(cr(x(/))), 

where  £  =  TgLg-\g{t )  =  g(t)-1  -g{t).  Let  us  apply  the  connection  form  to  both  sides  of 
the  above  equation.  Since  g(-)  is  horizontal  and  u  is  Ad-equivariant,  we  have 

0  =  (<r*u ,)(£(<))  +  *.  (4.1.6) 

From  the  definition  of  we  get  Equation  (4.1.5). 

The  (/-valued  form,  cr*u/  G  vs  1(B,G),  is  called  the  local  connection  form.  From 
(4.1.5),  if  G  is  an  Abelian  group,  the  holonomy,  or  g(l),  can  be  represented  explicitly 
as 

g(l)  =  exp(-f  (o’u)(x(t))dt)  =  exp(-  [[  cr'fi),  (4.1.7) 

Jo  J  J  v 

where  V  is  a  surface  in  B  with  z(-)  as  the  boundary  and  fi  is  the  curvature  form  of 
the  connection. 

In  next  subsection,  we  consider  an  application  of  the  theory  introduced  here  to 
simple  mechanical  systems  with  symmetry. 

4.1.2  Mechanical  Connection  and  Related  Control  Problems 

With  the  background  of  differential  geometry  introduced  in  the  preceding  subsec¬ 
tion,  we  now  ready  to  give  the  basic  ingredients  in  formulating  the  problems  of  geometric 
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phases  and  optimal  control  in  mechanics.  Consider  a  simple  mechanical  system  with  sym¬ 
metry  (cf.  Subsection  2.1.4),  ( Q ,  K,  V,  G ) ,  together  with  an  equivariant  momentum  map 
J  :  TQ  — ►  Q*  satisfying 

0  =  K(q)(vq,ZQ(q))  V£  E  Q.  (4.1.8) 

In  addition,  we  also  let  Q  be  the  total  space  of  a  principal  G-bundle  p  =  (Q,B,ir,G). 
Here,  the  base  space  B  is  also  referred  to  as  the  shape  space.  On  this  bundle,  the 
mechanical  connection  is  constructed  as  follows.  At  each  point  q  E  Q ,  define  the  locked 
inertia  tensor  as  the  mapping 

1(g)  :  Q  -*■  Q*  (4.1.9a) 

such  that 

(%K  0  =  k(<i){vq(<i),Zq(<i))  (4.1.9b) 

From  this  definition,  it  is  obvious  that  (I (<7)77,  £)  =  (I(g)£,  v)-  This  map  is  called  as 
the  locked  inertia  tensor  since,  for  coupled  rigid  or  elastic  systems,  it  is  the  moment  of 
inertia  tensor  of  the  relatively  locked/frozen  system  (cf.  Remark  3.5.5  (2)). 

Defining  a  (/-valued  1-form  a  G  w1(Q;Q)  by 
a  :  TQ  — »  Q 

(4.1.10) 

(q,v)  a(q,v)  =  I_1(g)(J(g,  v)), 
one  can  show  the  following  theorem  due  to  Kummer  and  Smale. 

Theorem  4.1.8:  The  (/-valued  one-form  a  in  (4.1.10)  defines  a  connection,  referred 
as  mechanical  connection ,  on  the  principal  bundle  p  =  ( Q,B,k,G ). 

Proof:  We  need  to  check  conditions  (1)  and  (2)  in  Definition  4.1.4.  For  any  u  E  (/*, 

fa  a(^g(g)))  =  I_1(g)(J ($«(?)))) 

=  <J(< £<?(?)), 

=  R(i moAz-'iqMQ) 

=  (I(q)(I-\q)v),  0  =  (v,  0- 

So,  condition  (1)  is  proved.  To  prove  (2),  we  first  show  that  the  locked  inertia  tensor 
has  the  equivariance  property: 


I  (*g{q))-Adgt  =  Adg-1I(q)-Z. 


(4.1.11) 
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For  any  rj  £  Q , 


(I(Zg(q))AdaZ,  V)  =  K(*g(qMAd30Q(*g(q)),VQ(*M)) 

=  /r(g)(Ts.,$J1(^3Og(^3(9)),^;17?Q($3(g))) 
=  ff(?)(6}(?),  {Adg-iTj)Q(q)) 

=  Adg-iq) 

=  77). 

So,  (4.1.11)  is  proved.  Here  we  used  the  relation  (cf.  Proposition  4.1.26  in  [i]) 


Tg-gQg'ivh^giq))  =  (Ad5-i7?)Q(g). 

Now,  let  £  =  (L*a)(q,v) .  Then, 

c  =  a(*a(g),T, *,(«))  =  1(5  •  5)-1  J(5  •  ?,T, *gv)  =  I (g  ■  q)~l  Ad*-X 3(q,v). 


This  implies  1(5  •  q) £  =  Ad*_1J(q,  v).  But,  by  (4.1.11), 


%  •  ?)C  =  %  ■  q)Adg(Adg-iQ  =  Ad I(AViC). 


Therefore,  we  have  J(g,  t?)  =  I(<7)(Ad3-i£)  or  £  =  Adg  1(g)-1  J(q,  v).  Condition  (2)  is 
proved.  | 

From  the  connection  theory  introduced  in  the  preceding  subsection,  when  the 
connection  form  is  defined,  we  have  a  vertical-horizontal  splitting  of  the  tangent  bundle 
TQ .  For  the  mechanical  connection  given  in  (4.1.10),  we  have 


TqQ  =  (Vert)q  ©  (Hor)g 

such  that,  for  each  vq  £  TqQ  , 

vq  =  (a(vq))Q(q)  +  (vq  -  (a(vq))Q(q)) 

=  (i(?)”V)q(?)  +  K  -  (%)~V)q(9)), 

where  /z  =  J(u?) .  It  is  readily  shown  that 


(4.1.12a) 


(4.1.12b) 


Hor  =  {(g,T?)  £  TQ\J(q,v)  =  0),  (4.1.13) 


and  the  splitting  in  (4.1.12)  is  orthogonal  one  with  respect  to  metric  Ii  . 
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To  formulate  the  kinematic  control  problem  explicitly,  we  consider  the  trivial 
bundle,  i.e.,  p  =  (B  X  G,B,ir,G).  Here,  the  control  is  internal  to  the  system,  which 
leaves  invariant  the  conserved  momentum  map  J .  Since,  by  definition,  a  principal  fiber 
bundle  is  locally  trivial,  the  equations  we  have  below  are  locally  true  in  general. 

The  tangent  space  at  each  point  (x,g)  E  Q  is  represented  by 

T(x,g)Q  =  TxB  x  TgG 

and  let  a  tangent  vector  in  T(x,g)Q  be  represented  by  U(x>3)  =  ( vx,v3)(Xtg )  =  (vx,  g-{)(Xig), 
where  £  =  TgLg-ivg  £  Q .  The  Lie  group  G  acts  on  Q  following  the  rule  4>(/x,  (x,g))  = 
( x,hg ),  where  h,g  G  G  and  x  £  B .  Then  the  infinitesimal  generator  corresponding  to 
T]  £  Q  is 

VQ(<l)=  J-  $(exp(eii),(x,g))=  {0,rj-g).  (4.1.14) 

Using  the  (? -invariance  of  K ,  we  have 

J(q,v)-rj=  K(x,g)((vx,g-O,(0,V-  9)) 

=  K(x,e)((vx,£),(0,Adg~iT /)) 

=  Adg-ir))  +  (j{x){vx),  Adg-irf) 

=  (Ad*g- i(J(x)Z  +  j(x)(vx)),  T]), 

where  e  is  the  identity  element  in  G;  I(x)  =  I(x,e)  is  referred  to  as  (local)  locked 
inertia  tensor  at  x,  represents  the  metric  on  G ,  and  j(x)  :  TXB  — >  Q *  comes  from  the 
cross  term  when  the  metric  K  is  written  in  terms  of  metrics  on  B  and  G .  Here,  the 
metric  on  B  is  induced  from  K .  Therefore,  we  have,  for  /x  =  J(g,  v) , 

/x  =  Ad*g-J(x)ii  +  Ad*g^j(x)(vx),  (4.1.15) 

or 

£  =  I(ar)  1Ad*p-  I(ar)  1  j(x)vx, 

or,  by  left  action, 

vg  -  g  ■  (I(z)  1Ad*p-I(x)  1j(x)vx).  (4.1.16) 

Given  a  closed  curve  x(-)  =  {x(t),t  €  [0,1]}  in  B  and  an  initial  point  qo  =  (xo,<7o)  in 
Q,  using  (4.1.16)  one  will  be  able  to  compute  the  shift,  or  the  phase,  in  G.  The  phase 
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generated  by  the  first  term  in  (4.1.16)  is  referred  to  as  a  dynamic  phase  and  the  phase 
generated  by  the  second  term  in  (4.1.16)  is  the  holonomy,  referred  to  as  geometric  phase. 
One  can  show  that  I(x)-1j(a;)(a:)  is,  in  fact,  the  value  of  the  local  connection  form  of 
the  mechanical  connection  at  (z,a:). 

Assuming  that  the  vector  x  or  the  velocity  of  the  path  in  B  can  be  directly 
controlled,  from  (4.1.16),  an  associated  kinematic  control  system  can  be  set  up  as 


x  =  u 

=  1Ad*gfj.-I(x)  1j(x)u), 


(4.1.17a) 


or  simply 

q  =  X^q)  +  H(q)u,  (4.1.17b) 

for  q  =  (g,x)  €  Q ,  where  X^q)  =  (0,5 -I(x)  1  Ad*  p)  is  the  drift,  7 i{q)  :  Tv{q)(B)  -+ 
TqQ  is  the  horizontal  lift  operator  and  u  £  Tir(g)(B)  is  a  tangent  vector  on  shape  space 
representing  controls.  Two  control  problems  for  this  system  can  be  framed  as  follows: 
(PI)  Given  two  points  qo  and  q\  in  Q ,  find  u(-)  steering  qo  to  q\  at  a  specified  time; 
(P2)  Given  two  points  qo  and  51  in  Q  on  the  same  fiber,  find  u(-)  steering  50  to  q\ 
while  minimizing 

[T 

/  <  u,u  >g  dt 

Jo 

for  Riemannian  metric  <  •  >g  on  B  and  the  fixed  final  time  T  >  0  subject  to 


(4.1.17). 

Remark  4.1.9:  One  should  note  that  the  vector  field  given  in  (4.1.17)  is  GM-invariant 
on  the  left,  and  not  invariant  under  the  action  of  the  structure  group  G.  In  addition,  It 
is  obvious  that  for  any  control  law,  the  solution  of  (4.1.17)  satisfies  J{(x,g),(x,g))  =  p. 
This  means  that  one  can  equivalently  construct  the  above  control  problems  on  principal 
fiber  bundle  (Q1  =  B  x  <S>,  B,  x',  GM),  where  7 r'  =  7t|q«.  On  this  bundle,  the  control 
system  (4.1.17)  becomes 


{x  —  u 

9  =  9  ■  (I(*)-V  -  ^(a:)-1^*)^)’ 
which  has  symmetry  under  the  action  of  structure  group  . 


(4.1.18) 
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The  problems  (Pi)  and  (P2)  are  standard  problems  in  control  theory,  namely, 
controllability  and  optimal  control.  (PI)  is  equivalent  to  the  problem  of  accessibility 
and  the  corresponding  condition  is  known  via  Chow’s  theorem  [9].  In  addition,  if  n  =  0 
and  the  system  is  accessible,  (P2)  is  the  isoholonomic  problem  in  [31],  or  a  special  case 
of  the  problem  of  singular  Riemannian/sub-Riemannian/nonholonomic  geodesics  [8,48]. 

In  the  next  section,  we  will  formulate  the  control  system  and  corresponding  optimal 
control  problem  for  the  system  of  a  rigid  body  with  two  oscillators  following  the  above 
procedure. 


4.2  Momentum  Map  and  Connection 

In  this  section,  after  a  complete  derivation  of  the  momentum  map  by  means  of 
standard  method,  we  give  an  explicit  expression  of  the  mechanical  connection  for  the 
system  consisting  of  a  rigid  body  with  two  oscillators. 


Figure  4.2.1  A  Rigid  body  with  two  oscillators 
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The  mechanical  system  we  consider  is  shown  in  Figure  4.2.1.  Here,  r0  is  the 
position  vector  of  the  center  of  mass  of  the  rigid  body  or  carrier  relative  to  the  center 
of  mass  of  the  system;  ri  and  r2  are  the  position  vectors  of  two  oscillators  with  point 
masses  mi  and  m2  relative  to  the  center  of  mass  of  the  system,  respectively;  the  mass 
and  moment  of  inertia  tensor  of  the  carrier  are  denoted  as  mo  and  Io ,  respectively;  Q 1 
and  Q2  are  the  position  vectors  of  two  oscillators  relative  to  a  frame  (not  displayed) 
fixed  on  carrier  and  A  6  50(3)  determines  the  orientation  of  the  carrier  with  respect 
to  an  inertial  frame  (not  displayed).  We  assume  that  no  exterior  force/torque  affects 
the  system  and  the  potential  energy  is  zero.  This  implies  that  the  inertial  frame  can  be 
placed  at  the  center  of  mass  of  the  system  and  ro,ri  and  r2  are  related  by 

2 

^  m^r;  =  0.  (4.2.1) 

t=0 

For  now,  r\  and  r2  (or  Qx  and  Q2)  are  assumed  to  be  arbitrarily  time  dependent 
vectors.  Later,  we  will  impose  constraints  on  them  to  study  the  effect  of  their  motion 
on  the  motion  of  the  carrier. 

From  the  above  setting,  we  have  the  configuration  space  Q  =  (-F3)2  X  50(3)  with 
local  coordinates  q  =  (rj,r2,A)  and  its  tangent  bundle  TQ  =  (T3?3)2  X  T50(3)  with 
local  coordinates  (q,v)  =  ((ri, r2,  A),  (fq,  f2,  Aft)) .  Here,  denoting  by  fl  the  vector  of 
angular  velocity  of  the  carrier  with  respect  to  the  body  fixed  frame,  we  used  the  fact 

A  =  Att  (4.2.2) 


with  the  standard  isomorphism 

K3  -+  so(3) 

/  0  X2  \  (4.2.3) 

{Xi,X2,X3)  ^  z3  0  -Xi  I  . 

\  -*2  X1  0  / 

The  Lagrangian  of  the  system  can  be  determined  easily  as 

"“*-1  1  777,^ 

£((n,r2,j4),(ri,r2,Aft))  =-  <  f2,/0O  >  +-(roj  +  — )  <  fi,ri  > 

2  ,  2  m°  (4.2.4) 

1  .  ros,  .  .  mi  m2  .  . 

+  o(m2  H - )  <  r2>r2  >  + -  <  rl,r2  >, 

2  mo  mo 
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where  <  •,  •  >  is  the  inner  product  in  3f£3 .  This  Lagrangian  is  given  by  a  Riemannian 
metric  K  on  <2,i.e.,  L(q,v)-  ^K(q)(v,v) ,  where,  for  (ux,u2,  Aft)  and  (wx,  w2,  A5)  £ 
T(r-i,r2,A)Q  1 

K(rx,r2,  A)((ux,u2,  Aft),  (wx,  w2,  AE))  = 


ml  s  ml 


<  ft,I0~  >  +(mi  4 - )  <  ux,wx  >  +{m2  4 - -)  <  u2,u;2  >  (4.2.5) 

mo  mo 

mxm2  mxm2 

4 - <  ux,w2  >  4 — — —  <  u2,w i  >  . 


m0 


m0 


(4.2.6) 


Let  G  =  50(3)  act  on  0  by 

$  :  50(3)  x  ((&3)2  x  50(3))  -  (K3)2  x  50(3) 

( A,(r2,r2,B ))  (Arx,Ar2,AB). 

From  (4.2.5),  one  can  show  that  G  acts  on  Q  by  isometries.  Therefore,  by  definition  (cf. 
Subsection  2.1.4),  the  system  ( Q  =  (5?3)2  x  50(3),  K,  V  =  0 ,G  =  50(3))  is  a  simple 
mechanical  system  with  symmetry. 

By  direct  or  some  intrinsic  calculations  on  50(3)  (cf.  [49]),  one  finds  the  Legendre 
transform,  at  (q,  v)  €  TQ  ,  as 


K\q)(v)  =  D2L(q,v)  =  ( AI0Cl,(mx  +  —)rx  + 

m0  mo 


mxm2  . 


ms 


mxm2  . 


ri,{.m2  +  — )f2  + 

mQ  mo 


^i)- 


The  infinitesimal  generator  of  the  action  in  (4.2.6)  corresponding  to  £  £  so(3)  is 


“  * 


$(e^,(r-i,r2,  A))  =  (fri,fr2,fA). 


e=0 


Then  from  (4.1.8),  the  momentum  map  is  calculated  as  follows: 

0  Zq) 


=  {AI0ft,  |A)  +  ((ma  +  —)rx  +  m-----?;2,  £rx) 

m0  "" 


m0 


4-  ((m2  4 - )r2  4 - n,  4^2) 

mo  mo 

,  .  r  _  ,  m? .  .  mxm2 

=  (AI0ft  +  (m  1  4 - ~)rx  X  rx  4 - rx  x  r2 

m0  m0 


,  m2 .  .  mxm2  . 

4-  (m2  4 - -)r2  X  r2  4 - r2  X  rx,0. 

mo  mo 

For  vector  space  TbS0(3)  for  some  B  £  50(3)  ,  pairing  (  •,  •)  is  defined  as 

(  ,  )  :T*50(3)xT50(3)-^K 

1  T 

(Al5  A2)  >->•  -frace(AiA2 ). 


(4.2.7) 


(4.2.8) 
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In  obtaining  (4.2.7),  we  also  used  the  facts: 


xy  =  x  X  y,  Vx,  y  6  5ft3, 


and 

AxAt  =  Tx,  Vx  6  5ft3,  VA  6  50(3). 


Letting  fi  =  J(g,  u),  from  (4.2.7)  we  have 


.  r  ~  /  m? .  .  rnirrii 

u  =  A70fi  +  (mi  4 - -Vi  x  ri  4 - rx  x  ro 

m0  m0 

.  m!,  .  mX7n2 

+  (m2  4 - -)r2  X  r2  4 - r2  X  rx. 

m0  m0 


(4.2.9) 


One  can  show  that  n  is,  in  fact,  the  total  angular  momentum  of  the  system. 

It  is  clear  that  ((5ft3)2  X  50(3))  is  a  trivial  bundle  with  the  structure  group  50(3) 
and  the  base  space  (3ft3)2  coordinatized  by  (Qi,Q2).  Using  coordinates  (QX,Q2,  A)  for 
the  configuration  space  Q ,  the  angular  momentum  in  (4.2.9)  can  be  rewritten  as  follows. 
From  Figure  4.2.1  and  Equation  (4.2.1),  we  have 


ri  =  r  4-  AQi,  i-  1,2 


and 


r  =  -A(exQx  4-  e2Q2), 

where  c;  =  mo+^)+m2 ,  i  =1,2.  Equation  (4.2.9)  can  be  rearranged  as 


H  =  A((/q  4*  A/o)fl  4-  D\Q\  4-  D2Q2 ),  (4.2.10) 


where  _  ^  ^  ^ 

A/o  =  — m(ciQx  4-  ^2 Q\  —  (oQi  4-  ^2 Q2 )2 ) 

D\  =  m[(l  -  €i)eiOi  -  exe202] 


D2  =  m[-ei€2Q  1  4-  e2(l  —  £2)02]- 


By  (4.2.10),  we  have 


fi  =  (/04-  A/0 )~l(ATii  -  (0x0,  +  L»2g2))  (4.2.11) 


or,  by  (4.2.2), 

-4  =  A[(Iq  +  A/q)-1  AT/x  -  (/0  +  A/o)"^©!^!  4-  02Q2)f,  (4.2.12) 
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where  [•]  =  (•).  Comparing  (4.2.12)  with  (4.1.16),  we  see  that 

hockiQiiQi)  =  Io  +  A/o 

is  the  (local)  locked  inertia,  and 

<4Qi>Q2)(QuQ2)  =  [(Io  +  &Io)~\D1Q1  +  D2Q2)] 

is  the  value  of  the  local  connection  form  at  point  ((Qi,  Q2),  {Qi,  Q2))  ETB  with  respect 
to  the  mechanical  connection.  This  connection  form  can  be  explicitly  given  by 

A(<5i,Q2)  =  ITolk^Qi  +  DidQi) 

=  TO^iocfc[((^  ~  ei)ei<2i  —  e\€2Q2)dQi  (4.2.13) 

+  (  —  €1  €2  <5l  +  62(1  —  e2)Q2)dQ2], 
where  the  operator  '  is  the  inverse  of  operator  '“'(cf.  (4.2.3)). 

Equation  (4.2.12)  can  be  used  for  computing  the  phases  of  the  system  and  the 
related  optimal  control  problem  mentioned  in  the  preceding  section.  In  particular, 
when  /x  =  0,  it  can  be  used  to  compute  holonomy,  or  geometric  phase  and  to  solve 
the  isoholonomy  problem.  It  should  be  noted  that,  in  this  case,  the  angular  velocity 
vector  of  the  rigid  body  in  a  body  fixed  frame  is  related  to  the  connection  form  by 

=  —u}(Qi,Q2)(Qi,Q2) 

=  ~  ei)eiQi  —  eie2Q2)Qi  (4.2.14) 

+  (  —  +  €2(1  —  t2)Q2)Q2]- 

In  the  following  sections,  we  will  consider  the  case  fi  =  0  only.  In  addition,  we  will 
assume  that  the  oscillators  are  confined  to  move  along  certain  guide-ways.  Under  this 
assumption,  the  bundle  structure  will  be  simplified  and  the  equations  for  phases  and  the 
connection  form  on  such  a  bundle  can  be  easily  derived  from  those  we  have  found. 


4.3  Planar  System 

We  now  assume  that  the  vector  r,  BQ\  and  BQ2  are  kept  in  the  same  plane  in 
inertial  space  and  that  mi  =  m2  (so  ei  =  e2  =  e).  In  addition,  we  choose  a  body-fixed 
coordinate  system  (or  frame)  0 -xyz  on  the  carrier  with  0 -z  axis  perpendicular  to  the 
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plane  and  the  origin  of  this  frame  is  placed  at  the  center  of  mass  of  the  carrier  (see  Figure 
4.3.1).  We  also  let  the  two  oscillators  move  along  two  parallel  guide-ways  such  that,  in 
the  0 -xyz  frame, 

Qi  =  0)T  and  Q2  =  (  l,x2(t),  0)T, 

where  xi(t),x2(t)  6  5ft  and  l  is  the  distance  of  the  guide-ways  to  the  origin  of  0 -xyz 
frame.  It  is  clear  that  the  configuration  space  is  reduced  to  Q  =  5ft2  x  S1 ,  which  will  be 
coordinatized  by  ( xi,x2,0 ),  and  the  principal  fiber  bundle  is  (5ft2  x  S1, 5ft2,  it,  Sl) . 


Figure  4.3.1  A  rigid  body  with  two  oscillators:  planar  case 


Setting  ^  =  0,  the  angular  velocity  Q.  =  (ftx,  Qy,  fiz)T  in  (4.2.14)  is  of  the  form 


where 


'  Qx  =  0 

<  fty  =  0 

_  •  mel  , , 

0-  =  6  =  - - (xi  —  x2) 

<  flock 


(4.3.1) 


hock  =  I;  +  me(2l2  +  (1  -  e)x{  -  2exix2  +  (1  -  e)xj) 
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with  Iz  the  moment  of  inertia  of  the  carrier  about  2  axis.  It  is  obvious  that  the  local 
connection  form  corresponding  to  the  mechanical  connection  on  the  principal  bundle 
(&2  x  is 


lj(x  1,22)  =  —  7"—  (dxi  -  dx2). 

hock 


(4.3.2) 


For  simplicity,  we  further  assume  that  the  amplitude  of  motion  of  each  oscillator 
is  very  small  in  comparison  with  the  the  spacing  of  two  guide-ways,  i.e. 


\Xi\/l  «  1. 


(4.3.3) 


Under  this  assumption,  using  Taylor  expansion  (up  to  quadratic  terms  of  a;*),  we  get  an 
approximate  u  (with  the  same  notation) 

mel(Iz  +  2  mel2  —  me(2l2  +  (1  —  e)a;2  —  2exia:2  +  (1  —  e)x2)) 


LJ  =  —  - 


( h  +  2  me/2)2 


-{dx  1  —  dx  2). 


The  above  procedure  is  called  localization  in  [23].  Since  we  are  interested  in  the  motion 
of  the  carrier  generated  by  the  motion  of  the  point  masses  on  a  closed  curve  in  shape 
space,  the  above  w  can  be  further  simplified  as  follows.  Applying  exterior  derivative  to 
the  above  equation,  we  have 


du  =  — 


2  mel 


(. Iz  +  2md2)2 

mel 


(xi  +  x2)dxi  A  dx 2 
d(x\dx  2  —  x\dx\). 


(Iz  +  2  mel2)2 

Then,  under  the  assumption  (4.3.3),  we  can  take  (for  closed  paths  in  shape  space) 


LJ  = 


mel 


(Iz  +  2mel2)2 


{x\dx  2  —  x\dx\), 


(4.3.4) 


modulo  an  exact  one-form. 

Let  c(-)  be  any  closed  curve  in  shape  space  3ft2 .  Since  S 1  is  Abelian,  from  (4.1.7), 
the  corresponding  geometric  phase  or  holonomy,  i.e.  the  drift  rate  of  the  carrier  about 
2-axis,  will  be 

=  ~  [w  =  - 1  m  {x'ix*  ~  x'dx')-  <4-3'5> 

Using  (4.3.5),  we  now  compute  the  geometric  phase  for  the  case  in  which  both 
oscillators  follow  sinusoidal  motions  with  different  amplitudes,  frequencies  and  phase 
angles. 
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Let 

x\ (t)  =  a\sin(ujt  -f  <f>i) 

X2  (t)  =  a2sin(nut  +  (fo) 

for  t  e  [0,27r/tD],  where  tD,at-,<^,-  are  real  numbers  and  n  is  an  integer.  Then,  the  closed 
curve  in  the  shape  space  forms  a  Lissajous  figure.  Substituting  the  above  x, (f)  into 
(4.3.5),  we  have 

6Z  =  |  (i,+ 2I1  fh)*  cos((j) 2  -  2<j>\)  if  n  =  2; 

1 0  otherwise. 

Therefore,  n  =  2  is  the  necessary  condition  for  generating  nonzero  geometric  phase 

under  assumption  (4.3.3).  With  this  condition,  <j>2  —  2<t>i  =  2 kir,  for  k  =  0,  ±1,  •  •  •  gives 

the  largest  phase  shift  and  (f>2  ~  2<f>i  =  (2&  +  1)tt/2,  for  k  =  0,  ±1,  •  •  •,  gives  zero  phase 

shift. 

We  next  formulate  the  optimal  control  problem  for  this  particular  mechanical 
system.  For  convenience,  we  re-scale  0  by  the  factor  jj  ■  Then  the  third  equation 

of  (4.3.1)  becomes 

9  —  x\x2  —  x\i\. 


The  optimal  control  problem  is  to  find  control  ui(-)  and  to 


minimize 


/  (uf  +  u\)d.t 

Jo 


subject  to 


'  X\  =  U\ 

{  x2  =  u2 


with  given  boundary  conditions 


9  =  x\u2  —  X%U\ 


(4.3.6a) 


(4.3.6b) 


*i(0)  =  xi(l)  =  *2(0)  =  *2(l)  =  0,  0(0)  =  0O,  9(1)  =  9i.  (4.3.6c) 

Remark  4.3.1:  It  should  be  noted  that  the  difference  of  0(0 )  and  0(1)  should  not  be 
too  large  because  of  the  assumption  on  our  simplified  model.  | 

Since  the  optimal  control  problem  (4.3.6)  has  fixed  boundary  conditions,  one  needs 
to  check  reachability  for  the  control  system  (4.3.6b).  Define  two  vector  fields  on  3i2  X  51 
by 


9i(<l)  = 


1 

0 

2 
X  9 


92(9)  = 
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Then  (4.3.6b)  can  be  represented  as 


9  =  +32(g)«2- 


It  is  easy  to  check  that 

(° 

isu[gu92]\(q)=  o 
\2 

and  51(9),  52(9),  [51,  [gi,g2]](q)  are  linearly  independent  for  any  q  =  (x1,x2,0)  e  ZPxS1 . 
By  Chow’s  theorem,  we  conclude  that,  for  the  system  (4.3.6b),  there  is  an  open  set  about 
the  point  q  =  (0,0,0)  (or  any  other  point  in  §£2  x  51 )  such  that  any  point  in  this  set 
can  be  reached  from  q  by  a  piecewise  constant  input  (ui,u2). 

Theorem  4.3.2:  If  (xi(-),  x2(-),  0(-))  is  an  optimal  trajectory  with  control 

(-u^(-),  u^-))  f°r  the  optimal  control  problem  given  by  (4.3.6),  then  there  exists  A(-)  = 
(Ai(-),  A2(-)>  ^3(‘))T  on  [0>1]  satisfying  the  ordinary  differential  equations 


X\  =  ux 

x2  =  u? 

I  0  =  x\u^  —  x\u{ 


(  Ai  =  —  2X3X1U2 
X2  —  2\2x2u* 
l  A3  =  0, 


(4.3.7a) 


where 


ul(q)  -  ~^(Ai  -  A3®2)  and  u2(i)  =  ^(a2  +  a3^i) 


(4.3.7b) 


with  boundary  conditions 


*i(0)  =  *i(l)  =  *2(0)  =  x2(l)  =  0,0(0)  =  0o,0(l)  =  0i.  (4.3.7c) 


Moreover,  the  system  (4.3.7)  is  completely  integrable. 

Proof: 

The  equations  (4.3.7)  can  be  derived  easily  from  the  Maximum  Principle.  The 
derivation  is  omitted  here.  We  just  prove  solvability.  From  (4.3.7)  one  can  get  differential 
equations  for  the  geodesics: 

'  ah  -  A3(xi  +  22)22  =  0 

<  x2  +  A3(xi  +  22)21  =  0  (4.3.8) 

,  23  +  2(22  -  2i)ii22  +  A3(2i  +  22)(222i  +  2222)  =  0 
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for  some  constant  A3 .  To  integrate  (4.3.8),  let  w  =  x\  +  x2  and  v  =  x\  -  x2  •  We  have 

{ii  =  X3ww 

W  =  —\3WV. 

By  integrating  the  first  equation  and  substituting  the  result  in  second  equation,  we  get, 
for  some  constant  c , 


w  +  X3w(c  +  —ic2)  =  0, 

which  is  the  equation  for  a  quartic  oscillator,  solvable  by  elliptic  functions,  i.e. 

f  dw  _ 

t  =  /  ~7=====tt= 

for  a  =  -^,6  =  -  ,  where  C\  and  C3  are  integral  constants.  Therefore,  we  conclude 

that  if  there  exists  an  optimal  solution  ( q(t),u(t ))  to  (4.3.6),  it  can  be  determined 
explicitly,  i.e.  the  boundary  value  problem  (4.3.6)  is  solvable.  | 

4.4  Three  Dimensional  System 

Starting  from  this  section,  we  assume  that  the  system  is  free  to  move  in  three 
dimensional  space.  Again,  we  assume  that  the  masses  of  the  two  oscillators  are  equal, 
i.e.,  mi  —  mi .  On  the  carrier  a  coordinate  system  0 -xyz  is  set  such  that  the  three  axes 
are  principal  axes,  i.e.,  the  moment  of  inertia  1 0  of  the  carrier  can  be  represented  as 

Iq  —  Dia,g(Ix,  Iy,  fz). 

Two  oscillators  are  allowed  to  move  on  the  carrier  such  that  Q\  and  Q3  satisfy 
Qi(f)  =  (xi(t)cos(ipi),  xi(t)sin(ip  1),  l)T 
Q2(t)  -  ( x2(t)cos(ip2 ),  x2(t)sin(tp 2),  -l)T , 

where  l  >  0  and  for  i  =  1,2  are  constants.  This  means  that  the  two  oscillators  are 
restricted  to  move  along  their  guide- ways  which  are  parallel  to  the  0 -xy  plane  and  are 
at  an  equal  distance  (Z)  from  the  plane  (see  Figure  4.4.1).  The  configuration  space  now 
becomes  Q  =  3ft2  X  50(3)  which  will  be  coordinatized  by  (xi,x2,  A) ,  and  the  principal 
bundle  is  <p  =  (?ft2  x  50(3),  5i2, 7r,  50(3)) . 
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Figure  4.4.1  A  rigid  body  with  two  oscillators:  3D  system 

In  the  above  setting  and  under  the  condition  n  —  0,  the  angular  velocity  of  the 
carrier  in  (4.2.14)  now  is  of  the  form 

H  =  ni(x1,x2)*i  +  ^2(x1,x2)x2,  (4-4.1) 


where 


1 

det(Iiock) 


1 

det(I\OCk) 


(  W21 
I  ^22 


with  det(IiOCk )  and  ,  for  i  =  1,2  and  j  =  1,2,3,  being  polynomials  of  x^  and  x2. 


And,  the  local  connection  form  for  the  mechanical  connection  on  principal  bundle  p  is 


ui  =  -n1(x1,x2)^i  —  Sl2(x\iX2)dx2.  (4.4.2) 

Although  the  above  restriction  on  the  motion  of  the  two  oscillators  simplifies 
the  bundle  structure  of  the  system,  the  expression  of  the  connection  form  is  still  very 
complicated.  In  the  rest  of  this  section  we  will  only  consider  some  problems  with  special 
ipi  and  tp2 . 

An  interesting  question  is  how  to  choose  the  constant  parameters  so  that  the  above 
three  dimensional  system  reduces  to  the  planar  system  discussed  in  preceding  section. 
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A  natural  guess  may  be  that 


0i  =  02  =  0-  (4.4.3) 

But  this  is  not  the  only  condition.  In  fact,  when  (4.4.3)  satisfies,  u>i3  and  u> 23  have 
simple  expressions: 

OJ13  =  -u>23  =  ^e2(4  -  Ix)m2l2sin( 20)(z2  -  Zi). 

Thus,  if  Ix  =  Iy  or  V’  =  0  or  -0  =  f ,  then  ui\2,  =  U22  =  0,  i.e.,  the  rigid  body  will  only 
move  (rotate)  about  the  axis  perpendicular  to  the  plane  formed  by  the  guide- ways  of  two 
oscillators.  Otherwise,  in  general,  the  parallel  motion  of  the  two  oscillators  will  cause 
the  rigid  body  to  drift  about  2  axis.  In  other  words,  when  02  =  02  holds,  a  sufficient 
condition  for  planar  drift  is  that  the  carrier  has  axial  symmetry  about  2-axis,  or  that 
two  point  masses  move  along  the  lines  which  are  parallel  with  the  same  principal  axis 
(0-z  or  0-  2/ )  of  the  carrier. 

Explicitly,  if  ^  =  0,  i.e.,  both  oscillators  move  parallel  to  principal  axis  0-z,  the 
local  connection  form  is 

f0 _ emljdxi  -dx2) _  T 

me(e  —  1  )xl  +  2c2mziz2  +  em(e  —  l)z|  —  2 eml2  —  Iy  ’ 

If  ip  =  j ,  i.e.,  both  oscillators  move  parallel  to  principal  axis  0-y,  the  local  connection 

form  is 

w  / _ emljdx 1  -  dx2) _  Q  Q,T 

v  me(f  —  l)i2  +  2 e2mx\x2  +  em(e  —  l)z2  —  2 eml2  —  Ix  ’ 

The  nonzero  terms  in  above  w’s  are  the  same  as  u  in  (4.3.2)  (up  to  a  choice  of  the 

coordinates  system). 

From  the  above  discussion,  it  is  apparent  that  if  one  is  interested  in  full  three 
dimensional  motion  of  the  carrier  body,  some  skewness  in  the  directions  of  particle 
motions  would  be  necessary.  In  the  following,  we  set  ^  =  0  and  ty2  =  y .  With 
this  setting,  the  constraint  on  the  motion  of  two  point  masses  becomes 

Q\  =  (zi(4,0,  ~l)T  and  Q2  =  (0,x2(t),l)T . 

The  body  angular  velocity  Q.  can  still  be  given  by 

fl  =  fii(zi,  x2)x1  +  fl2(xi,x2)x2 
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for  fix  and  flo  given  in  (4.4.1).  The  explicit  expressions  of  the  components  of  f ,  u>ij, 
are  still  complicated.  We  show  them  in  Appendix  4.A. 

We  now  show  that  the  kinematic  control  system  corresponding  to  (4.1.17)  with 
/x  =  0  is  controllable  in  a  neighborhood  of  (0,0,  A)  for  any  A  £  50(3).  From  (4.2.12) 
with  n  ■=  0 ,  the  kinematic  control  system  is  of  the  form 

'  ii  =  U\ 

-  ±2  =  u2  (4.4.4) 

.  A  —  A(Q \U\  +  Cl2u2)- 

Let  x  =  (21,2:2)  and  q  =  (2,  A)  be  a  point  in  Q .  Equation  (4.4.4)  can  be  represented 
as 

q  =  Xi(q)ui  +  X2(q)u2,  (4.4.4)' 

where  X\{q)  —  ((1, 0),  Afii(2))  and  X2(q)  =  ((0, 1),  A£l2(x)) . 

Again,  we  can  use  the  rank  condition  in  Chow’s  theorem  [9]  to  check  the  control¬ 
lability  of  the  system.  To  this  end,  we  need  a  formula  to  compute  the  Lie  bracket  of 
vector  fields  X\  and  X2  on  3i2  x  50(3),  where  X{  is  represented  as,  in  general, 

Xi(x,A)  =  (Fi(x),  AGi(x))  i  =  1,2  (4.4.5) 

for  smooth  mappings  iq  :  5ft2  — >■  5R2  and  Gi  :  3J2  — »■  K3 . 

Remark  4.4.1:  The  classical  methods  to  express  Equations  (4.4.4),  as  well  as  vector 
fields  in  (4.4.5)  is  to  introduce  Euler  angles  or  other  local  coordinates,  which  are 
very  important  for  numerical  calculation.  However,  such  expressions  are  usually  very 
complicated  and  very  hard  to  use  for  qualitative  analysis.  In  the  following  analysis,  we 
will  use  an  intrinsic  method,  namely,  keeping  the  differential  equation  (4.4.4)  in  matrix 
form  and  using  the  orthogonal  property  for  the  matrices  only.  As  we  will  see  later,  this 
method  will  give  us  a  clean  expression  for  the  necessary  conditions  for  the  optimal  control 
problem  and  make  reduction  of  the  necessary  condition  possible.  For  other  references 
on  the  same  treatment  of  problems  on  modeling  and  control  of  mechanical  systems  on 
50(3),  see  [35,49].  1 

Let  X  be  a  smooth  vector  field  on  a  smooth  manifold  P.  Let  </>(-, p)  :  3?  — »■  P  : 
t  <j>(t,p )  to  be  the  tangent  curve  of  X  at  p.  i.e.,  it  is  a  smooth  curve  in  P  such 
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that  (f>(0 ,p)  =  p  and  ji\t=o<t>{t,p)  =  X(p).  All  of  such  tangent  curves  at  p  forms  an 
equivalent  class,  which  is  denoted  by  [$(t,p)J.  Then,  the  tangent  vector  at  p  can  be 
represented  as 

X(p)  =  l<f>(t,p)] 

and  the  Lie  derivative  of  a  smooth  function  f  on  P  along  vector  field  X ,  Lxf(p)  = 
df(p )  •  X(p),  can  be  computed  by  using  the  equivalence  class  of  curves,  i.e., 


Lxf(p)  =  if  o  <Kt,p) J.  (4.4.6) 

This  expression  is  well  defined  since  /  o  <£(•,  p)  :  3?  — »  3?  for  given  point  p  on  P .  We 
then  have  the  following  result. 

Lemma  4.4.2:  Let  X\  and  X2  be  two  smooth  vector  fields  on  P ,  and  <f>i  and  <j> i  be 
any  corresponding  tangent  curves  at  p.  Then 


lXuX2](f)(p)=jt 


t= 0 


for  any  pgP  and  any  smooth  function  /  on  P . 

Proof :  With  notation  (4.4.6),  in  local  coordinates,  we  have 


(4.4.7) 


LXlLx2f{p)  =  \Lx7f  0  <f>i(t,p)\ 


=  II/ o  &(«, &(*,!>) )11 


dt 

d 

dt 


t= o 


d_ 

ds 


5=0 


D f(<l>i(t,p))  ■  X2{(t>i{t,p)) 


t= o 


=  (DDf(p)-Xl(p))-X,(p)  +  Df(p) 


dt 


A'2(<£i  (t,p)). 


t—0 


Since  (DDf(p)  ■  X\ (p))  ■  X2(p)  =  (DDf(p)  ■  X2(p))  ■  Xi (p),  the  Lie  bracket  of  vector 
fields  X\  and  A'2  is 


[  X2,X2  ](/)(p)  =  LX2LXl(f)(p )  -  LXiLxM)(p) 

=  (x2(Mt))  -  Xi m*Mf)(p) 

t=0 

for  any  p  €  P  and  any  smooth  function  /  on  P .  | 
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Now  we  use  (4.4.7)  to  compute  the  Lie  bracket  of  two  vector  fields  on  3ft2  X  50(3) 
given  in  (4.4.5). 

Proposition  4.4.3:  Given  two  vector  fields  X4  and  X2  on  3ft2  X  50(3)  as  shown  in 
(4.4.5),  the  Lie  bracket  of  X\  and  X2  is  given  by 

[Xl,X2 1  (x,  A)  =  -  | b-fi,  A[ft  x  ft  +  8-^n  -  fum  )  (4.4.8) 

for  any  point  (x,  A)  £  3ft2  x  50(3). 

Proof:  It  is  clear  that  the  tangent  curve  of  vector  fileds  Xi  in  (4.4.5)  at  (x,  4)  can  be 
represented  as 

<Mr)  =  (*  +  rFi(x),  Aexp(TGi(x))). 


Then,  by  (4.4.7), 


[XuX2](x,A)  = 


dr 


(F2(x  +  tFi)  -  F\(x  +  tF2), 


T  =  0 


4exp(rGi)G2(x  +  tFi)  -  4exp(rG2)Gi(x  +  tF2 )) 


■  <i£f>  -  +  iiH  ~  iN  » 

=<HFi“^F2’  a[Gix®2+^ 


dx 


Theorem  4.4.4:  For  system  (4.4.4),  there  is  an  open  set  about  (0,0,4)  for  any 
A  £  50(3)  such  that  any  point  in  this  set  can  be  reached  from  (0,0,4)  by  a  piecewise 
constant  input  (uj ,  u2 ) . 

Proof:  Let 


X3(x,4)^[X1,X2](x,4)=(F3(x),403(x)), 
X4(x,4)  =  [X!,X3](x,4)=  (F4(x),4G4(x)), 
X5(x,A)  =  [X2,X3  ](x,4)  =  (F5(x),4G5(x)), 


where  F,-  and  Gj,  i—  1,2,3  are  computed  by  using  (4.4.7).  It  is  easy  to  see  that 

Fi  =  (o)  fori  = 3'4'5- 

Using  symbolic  calculation  (say,  MACSYMA),  one  can  check  that  in  general, 


dei(G3,  G4,  G5) |r=(o,o)  7^  0. 
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Since  X  50(3))  ~  K5  and  the  vector  fields  shown  in  (4.4.4)’,  which  generate 

the  smallest  involutive  distribution,  have  special  form,  namely,  F\(x)  =  ( 1 , 0)T  and 
Fi(x )  =  (0,1)T,  the  above  equations  are  sufficient  to  show  that  vector  fields  X{  for 
i  —  1,...,5  are  independent.  Consequently,  the  control  system  given  in  (4.4.4)  is 
controllable  by  Chow’s  theorem.  | 

We  now  turn  to  the  optimal  control  problem.  Corresponding  to  (P2)  in  Subsection 
4.1.2,  the  goal  here  is  to  find  ui(-)  and  tr2(*)  to 


minimize 


[  (u2  +  u\)dt 

Jo 


(4.4.9a) 


subject  to 


f  X\  =  U\ 


=  u2 

(  A  —  y4(fij tzi  -f-  Q.2U2) 


(4.4.9b) 


for  given  boundary  conditions 


£i(0)  =  xi(l)  =  £2(0)  =  £2(1)  =  0,  A(0)  =  A0  G  50(3),  A{  1)  =  .4i  e  50(3).  (4.4.9c) 

The  necessary  conditions  for  the  above  problem  is  given  in  the  following  theorem. 
Theorem  4.4.5:  If  (£(•),  4(-))  is  an  optimal  trajectory  with  controls  (wj(-),  u^-))  for 
the  optimal  control  problem  given  by  (4.4.9),  then  there  exist  /r(-)  =  (mi(-)?M2('))  and 
A(-)  on  [0,1]  satisfying  the  ordinary  differential  equations 


,T  ,d£li  , 

Mi  =  -A T(-5~i«i  + 


dxi 


T,d0,i 


u\  + 


d£l2 

dxi 

(?02 


where 


^  ^  dx2  ~l  '  dx2 
(  A  =  A  x  (fljUj  +  02112), 


1  r 

u\{x,  A,n,X)  =  — — (/ii  +  A  0!) 

1  T 

Ui{x,  A,fi,X)  =  --(M2  +  A  02) 


Uo) 

Uo) 


(4.4.10a) 


(4.4.10b) 


with  boundary  conditions 


Xj (0)  =  £j(l)  =  £2(0)  =  £2(1)  =  0,4(0)  =  A0  G  SO( 3),  .4(1)  =  .4T  G  50(3).  (4.4.10c) 
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Remark  4.4.6:  Note  that  the  above  theorem  does  not  depend  on  the  expressions 
of  vectors  and  O2  and  the  dimension  of  the  shape  space  (now  it  is  two)  is  not 

an  important  factor  as  we  will  see  in  the  following  proofs.  Therefore,  the  above 
theorem  is  applicable  to  a  class  of  mechanical  systems  with  product  principal  bundle 


(RTO  X  50(3),  Rm,  7T,  50(3)) .  | 

In  the  following  we  use  two  methods  to  prove  Theorem  4.4.5:  perturbation  method 
(in  variational  principle)  and  geometric  method. 

Proof  of  Theorem  4-4-5  (Perturbation  Method): 

Applying  Lagrange  multiplier,  the  optimal  control  problem  is  equivalent  to  finding 
control  (ui(-), «2(-))  to  minimize  the  cost  function 

J(ui(-),u2(-))  =  /  {ul  +  ul+fii(ui-xi)+ix2(u2-x2)+  (Aa,  A(Q.iUi+fi2u2)-  A))dt, 

J  0 


where  (n\,[i2)  £  T*Xi  ^R2^  R2),and  £  T^50(3) ,  which  are  Lagrange  multipliers, 
{•,•)  is  the  natural  pairing  between  T*50( 3)  and  T50( 3)  defined  in  (4.2.8).  Since 
A^  £  T^50( 3),  it  is  of  the  form  A^  =  AA  for  some  A  6  J}3.  Then  the  cost  function 
becomes 


/(«i(-)>«2(0) 


/: 


H(x1 ,  x\ ,  x2 ,  ®2,  A,  A,  uu  u2 ;  pi ,  /i2 ,  X)dt, 


where 


H  =  u\  +  u22  +  H\{u\  -  x'i)  +  p2(u2  ~  x2)  +  (A, &1U1  +  ft2u2)  -  (AA,  A)  (4.4.11) 


which  is  a  mapping  from  SR2  x  T*(R2  x  50(3))  x  T(R2  x  50(3))  to  SR.  Due  to  the 
presence  of  the  last  term  of  (4.4.11)  we  cannot  directly  apply  formulae  for  the  adjoint 
equations  and  optimal  control  as  appearing  in  most  text  books.  However,  we  still  can 
use  the  idea  of  perturbation  in  variational  principle  for  our  problem. 

First,  consider  the  perturbation  of  function  (4.4.11).  Let 


Cc  =  {(ui(i),u2(*)),*  e  [0,1]} 


be  the  optimal  curve  in  control  space  and 


Cs  =  {(zi(<),x2(*),^(0)>*  £  [0,1]} 
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the  corresponding  optimal  curve  in  configuration  space  5J2  X  50(3).  Let 


Cep  =  {(ui(t)  +  e(f>Ul(t),u2(t)  +  e<j)U2(t)),t  e  [0,1]} 


be  the  perturbed  curve  of  Cc ,  where  e  £  9£,  <^>Ul(-)  and  </>«2(-)  are  smooth  curve  in  Si1 . 
The  corresponding  perturbed  curve  of  Cs  is  presented  by 

Csp  =  +  e<f>Xl(t),x2(t)  +  e<f>X2(t),  Aee*Ait)),t  6  [0,1]}, 


where  <f>Xl( •)  and  <j>x2{-)  are  smooth  curves  in  3J1  satisfying  0Xl(O)  =  <^Xl(l)  =  <j>X2( 0)  = 
4>x 2(1)  =  0  and  4>a(-)  is  a  smooth  curve  in  9ft3  satisfying  ^>>i(0)  =  ^5^(1)  =  0.  It  should 
be  noted  that,  since  exp  :  TeG  — *•  G  for  any  Lie  group  G  is  locally  diffeomorphism, 
Ae€^A  £  50(3)  is  qualified  as  a  perturbation  of  A.  Now  the  integrand  of  perturbed  cost 
function  Je  is 

He  =(ui  +  f  +  («2  +  )2 

+  Ml^l  +  #Ul  -  x-i  -  ej>Xl)  +  M2 (t*2  +  t<t>u2  -  *2  -  *<t>x2) 

+  AT[fl1(x1  +  i<t>xx ,  x2  +  e<f)X2)(u\  +  e<j>Ul) 

+  fl2(*l  +  €<f>xltX  2  +  e<f>x2)(u2  +  €<j>u2  )] 

-  (Ae^AX,Ae^A  +  A-^-(ee*A)). 

By  letting  the  derivative  of  Je  with  respect  to  e  at  e  =  0  be  zero,  i.e.,  ^-|£_0  =  0,  we 
have 


—  /  [2t*i  (t>Ul 
Jo 


d"  2u20u2  d"  MlC^ui  4*xx )  d-  M2(*Au2 


+  A  +  -^-4>x2)ui  +  Cl\4>ui  +  2)U2  +  Cl24>u2 ) 


9a:i  ri  '  (Lr2 
{A<f>A\,  A)  —  (AA,  A<f>A  +  A<j>A))dt. 


dxi 


(4.4.12) 


The  last  two  terms  in  the  integrand  of  (4.4.12)  can  be  re-arranged  as 

(A<pA\,  A)+(A\,  A<Pa)  +  {A\,A4>a) 

=  -tr(A<j>A\AT  +  AXcp^A1^  4-  A T(j>A) 

1  _ _ _  aT.  d 

=  -tr(-<j>AXU  +  A 4>ASl  -  A  <j)A)+  ^(A  •  4>a) 

=  —(A  —  A  x  fi)  ■  4>a  +  ^(A  •  <j>A ), 
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where  9  =  (AT A)  =  fiiiii  +  fl2u2  •  Here  we  used  a  fact, 

9X  -  Afi  =  [  ft,  A  ]  =  9  x  A. 

Also  re-arranging  the  other  terms  in  (4.4.12),  we  get 

0  =  J  [— ^(A  •  4>a  +  Mi  4>XX  +  M2^x2) 

-f  (2ux  +  Mi  +  XT 9\)<j)Ul 

+  (2u2  +  M2  +  Arfl2)^u2 
..  ,'T',d9\  092 

+  (/il  +  A  {o^Ul  +  o^U2)<t>x' 

..  .T/5Qi  5fl2 

+  (/i2  +  A  W*1  +  ~d^U2)<t>x* 

+  (A  -  A  x  9)  ■  (f>A]dt. 


(4.4.13) 


Now,  by  the  conditions  on  0!, ,  and  <f>A  at  t  =  0  and  1,  the  first  term  of  above 
integration  is  zero.  In  addition,  we  can  choose  functions  MiO’/^O)  and  A(-)  on  [0,1] 
such  that  the  coefficients  of  <f>Xl ,  </>l2  and  4>a  to  be  zeros;  this  gives  the  last  three 
equations  of  (4.4.10a).  Finally,  since  the  system  is  controllable  at  (0,0,  A)  (cf.  Theorem 
4.4.4),  the  coefficients  of  0Ul  and  (f)U2  are  also  zero;  this  gives  u J  and  in  (4.4.10b). 

A 


Proof  of  Theorem  4-4-5  (Geometric  Method): 

Here,  we  consider  a  slightly  more  general  form  of  the  problem  (4.4.9).  The  optimal 
control  problem  now  is  to  determine  control  (ui(-),  u2(-))  to 

minimize  /  (u\ u\)dt  (4.4.14a) 

Jo 


subject  to 

'  X  =  fi(x)ui  +  /2(x)u2 

-  _  (4.4.14b) 

A  =  A(fli(x)ui  +  fl2(x)u2) 

with  boundary  conditions 


(x(0),  A(0))  =  (x0,  A0)  and  (x(l),  A(l))  =  (xx,  A^,  (4.4.14c) 

where  (x(i),A(t))  e  0J2  x  50(3),  Vi  E  [0,1],  /,-  :  »2  —  ft2 ,  9i  :  K2  — >•  3?3  for  i  =  1,2 
and  u(-)  -  («!('•),  u2(-))  ’■  [0,1]  — ►  3?2  is  a  piecewise  smooth  function.  Applying  the 
Maximum  Principle  to  this  problem,  we  have  the  following  result. 
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Denoting  by  z  =  ((x,A),(p,AA))  =  ((x,  A),(/j,,  AX))  ~  ((a:,  ,4),  (p,  A))  a  point  in 
T*(5J2  X  50(3)),  we  define  pseudo- Hamiltonian,  H  :  T*(5ft2  X  50(3))  x  5i2  — »•  5i,  by 

H(z,u)  =  <  u,u  >  +  <  p,fi(x)ui  +  /2(x)u2  >  +{Aa,  A(&i(x)ui  +  ft2(x)u2)) 

=  <  U,  U  >  +  <  fl,fi(x)ui  +  /2(x)u 2  >  +  <  A,Di(x)ui  +  fl2U2  >  . 

Define  Hamiltonian 

H(z )  =  min  7i(z,u). 
v  '  «€Si2 

Since  the  control  space  is  unbounded,  one  can  find  the  functions  u*{z )  =  («J(z),  Mjj(z)) 
on  T*(3J2  x  50(3))  with 

UKZ)  =  -4(a*T/ i(*)  +  ATfti(z)) 

<  ‘  (4.4.15) 

«2(*)  =  —  «(/*T/a(«)  +  ATD2(x)) 

Z 

such  that 

1  2 

ff(*)  =  W(x,  «*(*))  =  --  £(<  M,/i  >  +  <  A,fl<  >)2.  (4.4.16) 

t=l 

From  the  Maximum  Principle,  we  know  that  the  existence  of  optimal  control  for  problem 
(4.4.14)  implies  that  there  is  a  solution  to  the  following  system  on  T*(5i2  x  50(3)): 

z  =  XH(z)  (4.4.17) 

with 

r(z(0))  =  (x0,A0)  and  r(z(l))  =  (x1}  ^i), 

where  Xh  is  the  Hamiltonian  vector  filed  with  respect  to  the  Hamiltonian  (4.4.16)  and 
r  :  T*(!R2  X  50(3))  — *  5?2  X  50(3)  is  the  canonical  projection.  Our  goal  now  is  to 
determine  the  vector  field  Xh  on  T*(3?2  X  50(3)). 

Recall  that,  for  given  n -dimensional  smooth  manifold  Q  ,  its  cotangent  bundle  T*Q 
has  a  canonical  symplectic  form  fio  •  Given  a  Hamiltonian  H  on  T"Q  ,  the  corresponding 
Hamiltonian  vector  field  Xh  on  T*Q  is  defined  by 

MXh,Y)  =  H(Y)  (4.4.18) 

for  any  Y  £  X(T*Q).  The  local  expression  of  (4.4.18)  can  be  given  as  (cf.  Theorem 
3.2.10  in  [i]) 

fi0(x,a)((x,a,ei,/3i),(x,a,e2,/?2))  =  (&,ei)  ~  <5i,e2)  (4.4.19) 
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for  (x,a)  e  T*Q  and  (eufc)  €  T{x,a)T*Q ,  i  =  1,2.  In  our  problem,  Q  =  ft2  x  50(3). 
For  the  Hamiltonian  given  in  (4.4.16),  the  corresponding  Hamiltonian  vector  field  Xh 
in  (4.4.17)  will  be  determined  by  (4.4.18). 

Let 

y(t)  =  ((*  +  tv,  Aet2),  (fi  +  t<f>,  He1"  (A  +  0))) 

be  an  integral  curve  on  T*(5ft2  X  50(3))  at  z  —  ((a:,  A),  (/x,  HA))  for  any  v,<j>  E  5R", 
a,/3  E  f?3 .  Then  its  tangent  vector  at  z  is  given  by 

Y (z)  =  ((a;,  A),  (/x,  HA),  (v,  AS),  (<f>,  H(SA  +  $))). 

Now  the  right-hand-side  of  (4.4.18)  can  be  calculated  as 

H(Y)  =  jt\ t=oH(y(t)) 

1  A  2 

=  -7-j-|t=o  'y  {<  +  0,  fi(x  +  tv)  >  +  <  A  +  tf3,Cli(x  +  tv)  >)2 

4  at  f-f 

4=1 

1  2 

=  —T  y,(<  Mi  /i  >  +  <  ^i^*'  >)' 

4=1 

(<  <£,  Zi  >  +  <  M,  Dfi  ■  V  >  +  <  /?,  fii  >  +  <  A,  Dili  ■  V  >) 

=  <  <f>,  flUi  +  /2U2  >  +  <  /3,  S^iTXj  +  0,2^2  ^ 

+  <  ft,  (Df\u\  +  O/2U2)  •  V  >  +  <  A,  (flftrtij  +  T>ft2«2)  •  v  >>  (4.4.20) 
where  u*  is  given  in  (4.4.15)  and 

Dh-iWi  f£)  and  ^  =  (g-  S£) 

for  i=  1,2  are  2x2  and  3x2  matrices,  respectively. 

On  the  other  hand,  let 

Xff(z)  =  ((x,  A),  (/x,  HA),  {w,  He),  (77,  H(eA  +  ?))) 
for  some  vectors  w,t]  €  5R2  and  e,6  €  &3  which  will  be  determined.  Applying  (4.4.19), 
we  have 

{lo(z)(Xff,  Y)  =  <(<£,  H(SA  +  P)),  (w,  He))  -  ((17,  H(eA  +  ?)),  (v,  AS)) 

-<  <j),w  >  +(H(aA  +  P),AT)—  <rj,v  >  -(H(eA  +  6),  AS) 

=  <  </>,  w  >  -  <  r),v  >  +^tr(PTe-6TS  +  AT[e,S]) 

=  <  4>,  w  >  -  <  T],v  >  +  <  (3,e  >  -^tr([e,  A]  +  S)S).  (4.4.21) 
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In  order  to  have  equation  (4.4.18),  we  need  to  make  (4.4.20)  equal  to  (4.4.21).  This  leads 
to  following  choice' of  w,e,r]  and  S. 

w  =  hu\  +  f2u  j 

T)  =  -(Df -  (DOi^  +  DSl2u*2)T  A 

(4.4.22) 

e  =  +  Cl2u2 

6  =  A  x  (fliu*  +  £l2u2). 

With  above  equations,  the  vector  field  X#  can  be  completely  determined. 

In  summary,  the  differential  equation  (4.4.17)  now  has  the  following  form. 

'  i  =  +  f2{x)u2 

A  =  A(fii(xX  +  Q.2(x)u2) 

<  (4.4.23a) 

A  =  -(D/X(a;)u*  +  Df2(x)u;)Tn  -  (D^x)^  +  0ft2(z)t4)TA 

A  =  A  x  (fii(x)ui  +  Cl2(x)u2) 

where 

=  ~\{pT  hix)  +  ATOi(z)) 

<  1  (4.4.23b) 

u2  =  ~~{pTMx)  +  ATft2(z)). 

When  /j  =  (1,0)T  and  /2  =  (0,1)T,  (4.4.23)  leads  to  (4.4.10).  | 

Remark  4.4.7:  From  the  second  proof  of  Theorem  4.4.5,  we  can  see  that  the  geometric 
treatment  of  the  optimal  control  problem  allows  us  to  define  a  Hamiltonian  vector 
field  on  the  manifold  T*(K2  X  50(3)).  The  solution  of  the  optimal  control  problem 
will  correspond  to  a  trajectory  of  this  vector  field.  Of  course,  in  general,  it  is  almost 
impossible  to  find  explicit  solutions  although  we  did  find  it  for  planar  system  as  shown  in 
Section  4.3.  However,  as  we  will  show  in  the  next  section,  identification  of  symmetries  in 
such  a  Hamiltonian  system  will  allow  a  reduction  in  the  order  of  the  system  by  applying 
symplectic  or  Poisson  reduction  theory  which  we  reviewed  in  Chapter  II.  | 


4.5  Symmetry  and  Reduction 

Recall  that  the  manifold  P  =  T*(3£2  X  50(3)) ,  parameterized  by  z  =  (a:,  A,  /j,7  AX) , 
is  symplectic.  Hence,  it  is  also  Poisson  (cf.  Section  2.1).  One  can  verify  that  the  Poisson 
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bracket  of  smooth  functions  F\  and  F2  on  P  is  given  by 

dFx  dF2  dF2  8Fi 
dpi 


{FuF2}p(z)=- 


dx  dpi  dx 

+  {DaFuD^F*)  -  (DaF2,Da~Fx). 

In  the  preceding  section,  we  have  shown  that,  for  Hamiltonian 

,  2 

H(z)  =  --  <  A >)2, 

1=1 


(4.5.1) 


(4.5.2) 


the  Hamiltonian  vector  field  is 


XH(z) 


u0 


A(Q1nJ'  +  n2u5) 


-AT(g-«?  + 
-AT(^«J  +  g*«5) 
A  x  (fljn^  -f-  02^2)  ' 


(4.5.3) 


where 

«!(*)  =  -|(Mi  +  ATfti) 

«a(*)  =  +  ATftj). 

Consider  the  action  of  50(3)  on  3J2  x  50(3) 

$  :  50(3)  x  (3?2  x  50(3))  ti2  x  50(3) 

( B ,  (x,  A))  > — ►  (x,  BA) 
and  its  cotangent  lift  on  T*(5i2  x  50(3)) 

$T*  :  50(3)  x  T*(K2  x  50(3))  -  T*(K2  x  50(3)) 

(B,(x,A,pi,A\))  y-r  (x,  BA,  pi,  BAX). 

The  quotient  space  P  =  T*(3J2  X  50(3))/50(3)  ~  SR2  X  R2  X  3?3  can  be  coordinatized  by 
z  =  (x,/i,  A).  Let  7T  be  the  canonical  projection  from  P  to  P.  We  then  have  following 
theorem: 

Theorem  4.5.1:  The  Hamiltonian  system  (P,  {  ,  }p,Xn)  has  50(3)  symmetry  and 
is  Poisson  reducible.  The  Poisson  reduced  system  is  given  by 


j=  A  (z)XH, 


where  A(z)  is  the  reduced  Poisson  structure  given  by 


(  °  1 

A  (I)=  -/  0  0 

\  0  0  A/ 


(4.5.4) 


(4.5.5) 


98 


for  I  being  2x2  identical  matrix  and  0’s  null  matrices  with  suitable  dimensions,  and 
H  the  reduced  Hamiltonian  given  by 

H  0  7T  =  H. 


In  addition,  the  Casimir  functions  for  the  Poisson  reduced  system  are  all  real-valued 
smooth  functions  of  ||A||2  which  is  a  first  integral  for  the  system,  i.e.,  for  some  constant 

Ci, 

||A||2  =  Ci.  (4.5.6) 

Proof:  It  is  obvious  that  the  Hamiltonian  (4.5.2)  is  invariant  under  the  action 

4>t*  since  it  does  not  have  matrix  A  in  its  expression.  This  immediately  implies 
the  50(3) -symmetry  for  the  system.  Moreover,  the  reduced  Hamiltonian  H  on 
T*(9?2  x  50(3))/50(3)  is  simply 

H(x,(i,\)=  H(x,A,n,A\).  (4.5.7) 

Let  /i  and  /2  be  smooth  functions  on  P  =  T*(312  X  50(3))/50(3) .  Let  F\  and 
F2  be  lifed  functions  on  P  —  T*(3?2  x  50(3))  such  that 

Fi(x,  A,  fi,  AX)  =  fi(x,n,\),  i—  1,2. 

We  need  to  find  the  expression  of  {/i,/2}p  such  that 

{/i ,  /2}p(x, M,  A)  =  {Fx ,  F2}p(x ,  A,  Hi  AX).  (4.5.8) 

where  {Fi,F2}p  is  given  in  (4.5.1).  As  we  have  seen  in  the  second  proof  of  Theorem 
4.4.5,  a  tangent  vector  Y  on  T*(SR2  X  50(3))  at  z  =  (x,  A,/i,  AA)  has  the  form 

Y(z)  =  (v,Aa,w,A(  2A  +  fi))z, 

where  v,w  6  3t2  and  a,/3  6  5R3.  An  integral  curve  of  Y  at  z  can  be  written  as 

y(t)  =  (x  +  tv,  Aeta,fi  +  tw,  Aeta( A  +  tf3)). 


dFi  ■  Y(z)  = 


Then,  by  the  definition  of  F,- ,  we  have 

d_ 
dt 
d_ 
dt 
dfi 


Fi(x  4-  tv,Aeta,H  +  tw,Aeta{ A  +  tf3)) 


t= o 


fi(x  +  tv,H  +  tw,  A  +  tfl) 


(4.5.9) 


f=0 

,  dfi  dfi 

■  •  V  +.  —  •  w  +  —  •  p. 
ox  Oji  o  A 
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On  the  other  hand 


8F-  d  F- 

dFi  •  y (z)  =  -  n  +  (DAFi,Aa)  +  •  w  +  (D A~F{,  A(a\  +  0)) 


dx 

dFi 


dfi 


8Fi 


(4.5.10) 


dx  v+~dji'w  +  ( DAFi " ( DA\Fi )A’ +  (DA\Fi’ A 


Comparing  (4.5.9)  with  (4.5.10),  we  get 


m 

dx 


A 

dx  ’ 


dFj 

dfi 


A 

dii' 


D  ~F-  _  aA 
UA'  A  dx' 


DAF<  =  Adp. 


(4.5.11a) 

(4.5.11b) 


From  (4.5.1),  (4.5.8)  and  (4.5.11),  we  have 

r  f  f  \  !  Adh\  Adh\  /Adh\  Adh\ 

{fi,f2}p(z)  ( Afi\X’Ao\ ) 


+ 


0A"’“0A'  '"0  A 

A.  .A  -  A .  A 

dx  dfi  dx  d/i 


_  .\,A.A_A.A 

^  0A  0A  0a:  dfi  dx  dfi 

,dh. 


(4.5.12) 


\ 


\  Or 


3/x 


£&T)  -7  o  o 

'  A 


3A 


0  7  0 
0  0  A 

where  7  is  the  2x2  identity  matrix.  Therefore,  the  matrix  in  (4.5.5)  is  reduced  Poisson 


structure.  The  proof  of  the  rest  of  this  theorem  can  be  easily  carried  out.  | 

Since  we  are  dealing  with  a  trivial  principal  bundle  here  with  structure  group  as  the 
symmetry  group,  after  recognizing  the  coordinates  for  P  one  should  be  able  to  find  the 
Poisson  reduced  system  by  eliminating  the  third  equation  of  (4.4.10a)  and  determine  the 
reduced  Poisson  structure  from  it.  The  first  integral  in  (4.5.6)  can  be  easily  determined 
from  the  last  equation  of  (4.4.10a). 

Following  Remark  4.4.6,  it  is  clear  that  this  reduction  theorem  does  not  depend 
on  the  forms  of  vectors  Oi  and  0,% ,  either.  It  can  also  be  extended  to  a  system  on  a 
principal  bundle  (3J”1  x  5’0(3),  5Rm,  7r,  SO(3)) . 


4.6  Approximation  and  Further  Reduction 

Up  to  now,  we  have  made  no  simplifications  or  approximations  of  the  optimal 
control  problem.  It  is  customary  to  make  ad  hoc  approximations  in  the  interest  of 
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ensuring  analytic  integrability  or  numerical  solvability.  However,  in  the  process  one  can 
easily  destroy  symmetries  inherent  to  the  problem.  This  is  highly  undesirable.  On  the 
other  hand,  in  many  physical  systems,  simplification  and/or  approximation  may  bring 
symmetries  to  the  system.  Our  purpose  in  this  section  is  to  impose  suitable  assumptions, 
explore  further  symmetry  and  reduce  the  order  of  the  system  again. 

As  having  done  in  Section  4.3,  we  assume  that  the  distances  of  the  point  masses 
to  0 -z  axis,  X{ ,  are  very  small  in  comparison  with  the  distances  of  them  to  0-xy  plan, 
l,  i.e., 

1**1//  <<  1.  (4.6.1) 

By  doing  so,  we  ignore  the  higher  order  terms  with  (^•),(^'L)-?  for  i  +  j  >  2  in  both 
numerators  and  denominators  of  fii(xi,x2)  and  f22(x!,x2).  Then  the  approximate 
angular  velocity  (using  the  same  notation)  of  the  rigid  body  becomes 


where 


for 


/  0  \  /  a2  \ 

fl  =  I  a2  J  i\  +  I  0  J  x2  (4.6.2) 

\&ix2  )  \boXi  J 

a\  =  —-^emlzl(2eml2  +  Ix ) 

—  -i-e2m(2em/2  +  Ix)(2eml2  —  ml 2  +  Iy ) 

(4.6.3) 

a2  =  ——emlzl(2eml2  +  Iy ) 

&2  =  —  -^e2 m(2eml2  +  Iy)[2eml 2  -  ml 2  +  Ix) 


A  =  Iz(2eml2  +  /r)(2em/2  +  Iy). 


Remark  4.6.1:  From  (4.6.2),  we  see  that  if  second  point  mass  is  fixed  on  0 -z  axis,  i.e. 
x2  =  0,  an  infinitesimal  linear  motion  of  first  point  mass  along  0-x  axis  will  generate 
infinitesimal  rotational  motion  of  rigid  body  about  0 -y  axis  only.  However,  if  there  is 
an  small  offest  of  second  masses,  i.e.  x2  ^  0,  an  infinitesimal  linear  motion  of  first  point 
mass  along  0-  x  axis  will  generate  infinitesimal  rotating  motion  of  rigid  body  about  both 
0 -y  axis  and  0 -z  axis.  Therefore,  the  above  assumption  does  make  physical  sense.  l 
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Inspired  by  the  symmetric  heavy  top,  we  pose  one  more  important  assumption. 
We  assume  that  the  rigid  body  is  symmetric  about  0-  z  axis  which  implies 


lx  —  ly 


(4.6.4) 


Under  this  assumption  Q,  takes  the  form: 

(  °\  (a\- 

ft  =  I  a  I  £1  +  I  0  I  %2, 

\  bxz  )  \-bxiJ 

where  a  =  a\  and  b  =  b\.  The  approximation  of  H  is,  for  z  =  (i,/x,  A), 
H\2  (•?)  =  —  —  ((6A3X2  +  Mi  +  0.X2)2  +  (bX^Xi  —  M2  —  a^i)2)* 
Applying  H\2  to  (4.5.4),  we  get  corresponding  Poisson  dynamics 


(4.6.5) 


(4.6.6) 


z  =  A(2)  V  H\2 


or,  explicitly, 


*1  =  - 


6A3X2  +  Ml  "b  (1A2 


x2  = ' 


Ml  = 


M2  =- 


6A3X1  —  M2  —  ttAi 
2 

b2 A3X1  —  6A3M2  —  0&A1A3 
2 

b2X 3X2  +  &A3M1 +  abX2X3 


Ai  =  -  —  (b2 X2X3X2  +  (&A2M1  —  (26A3  +  abX^)x2  +  b  A2A3XJ 
+  (— 6A2M2  —  a6A1A2)x1  —  aA3Mi  —  a2  A2A3) 

A2  =  ~(62  A1A3X2  +  (&A1M1  +  abX\X2)x2  +  &2AjA3X2 


A3  = 


-p  ( — bXifj-2  ~b  G&&A3  —  ufrA^xi  (ZA3M2  ^  ^1^3) 
abXiX^X2  +  a6A2A3Xi  —  aA2M2  +  <*AiMi 


(4.6.7) 


Remark  4.6.2:  A  standard  way  to  approximate  Hamiltonian  system  is  to  keep  some 
terms  of  Taylor  expansion  of  the  Hamiltonian  up  to  a  certain  order.  Our  approximation 
above  keeps  0th,  1st  and  part  of  2nd  order  terms  of  Taylor  expansion  of  H .  Its  physical 
meaning  is  given  in  Remark  4.6.1.  I 
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Next,  we  show  that  the  above  system  admits  a  symmetry  group  and  the  order  of  the 
reduced  Hamiltonian  system  can  be  finally  brought  down  to  4.  Consider  a  one-parameter 
group  Gt  —  S 1  with  each  element  having  the  form: 

gT  =  BlockDiag(Rot(r),  Rot(r),  Rot3(—r)), 

where 

/  M  f  cos(t)  sin{r ) 

Rot(r)  =  (  cos('r)  •sjn(r)  j  and  Rot3(T)=  -sin(r)  cos(r) 

V  '  V“5in(r)  cos{t))  l  0  0 

Define  an  action  of  GT  on  P  —  T*(K2  x  50(3))/50(3)  ~  3?7  by 

:  GT  x  §K7  ->  &7 

(r,  (a;,  /x,  A))  (iEot(r)a:,  Eot(r)/x,  #ot3(-r)A). 

We  then  have  following  striking  fact. 

Theorem  4.6.3:  Following  the  rule  of  (4.6.9),  the  group  Gr  acts  on  the  Poisson 
manifold  P  canonically,  i.e.,  for  any  gT  €  Gr, 

{A,/2}p  o  v3r  =  {A  o  *gr,f 2  o  *aT}p>  v/i  e  c°°(P),vffT  e  Gr. 

In  addition,  the  approximate  Hamiltonian  Hu  is  Gr -invariant,  i.e. 

Huiz)  =  Hu^gr{z)). 

Proof :  The  first  assertion  is  equivalent  to 

D9ar(z)A(z)D*gr(z)T  =  A(¥,r(2)). 

This  can  be  shown  by  a  straightforward  calculation.  So  is  the  second  assertion.  | 

Remark  4.6.4:  From  this  theorem,  one  immediately  concludes  that  GT  is  a  symmetric 
group  of  the  system  (4.6.7).  ■ 

It  is  clear  that  by  Theorem  4.6.3  the  conditions  of  Noether’s  theorem  introduced  in 
Section  2.1  are  satisfied  and,  consequently,  the  associated  momentum  map  is  an  integral 
of  the  reduced  system  (4.6.7).  From  Section  2.1  we  know  that  the  momentum  map 
J  :  P  — ►  Q*  (the  dual  of  Lie  algebra  of  GT )  of  this  action  is  defined  by 

<J(3?),0  =  J(0(5)  (4-6-10) 


(4.6.8) 


0 

0 

1 


(4.6.9) 
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(4.6.11) 


for  all  £  6  QT  and  z  6  P ,  where  J  :  QT  — ►  C°°(P)  is  a  linear  map  such  that 

—  £p- 

Theorem  4.6.5:  The  constant  momentum  map  for  the  system  (P,  {  ,  }^)  correspond¬ 
ing  to  the  action  ^  of  Gr  on  P  is 


.7(0(2)  =  -*iM2  +  *2^1  +  A3. 


(4.6.12) 


Proof: 


Recall  that  the  Hamiltonian  vector  field  Xh12  on  P  is  given  by 


**„(/)  =  {/.ffwh 


V/  6  C°°(P), 


and  from  (4.6.11),  we  can  determine  the  momentum  map  by 


{/, -7(0}*  =  *»(/) 


V£  e  gT. 


From  (4.6.8)  we  know  that  £  €  (7T  is  of  the  form 


f  =  «'^°“»((-l  o)’(-i  o)'(j  ®  “))• 

where  £'  is  any  constant  in  3?  which  will  be  chosen  as  1  later.  It  is  easy  to  show  that 
the  infinitesimal  generator  of  the  action  (4.6.9)  corresponding  to  £  is  given  by 


-  S 


tf(exp(£r),z)  =  (x2,  -xi,p2,  -Pi,  -A2,  A1?  0)T, 


T  =  0 


where  $  is  defined  in  (4.6.9).  Then,  for  any  function  /  on  P, 


df 


df 


df 


df 


df 


df  tndf 


f?(/)(3 0  =  -  xii±  +  M2  -  Mi  Hr  ~  A2^-  +  Xrjfc  +  0^-.  (4.6.13) 


dx i  dx2  r  dpi  ^ldfi2  dXi 

On  the  other  hand,  let  /(£)  be  a  function  on  M ,  then 


d  A3 


{/, -7(0X2)  =df(J)TA(T,dJ{a Z) 


JJ{Q  df  |  aj(Q  a/  aj(Q  3/  MO  df 

djj,\  dx i  dfi2  dx 2  dxi  dn\  dx 2  dfi2 


dm  ,dJ(0,df 


dm  ,  ,  0-7(0,  0/ 


,  “^vs/  ^  ~<'vs/,  f  \  vs/  ,  >  ^"vs  /  ,  / 

+  (A3~aAT  ~  ^-dxT’ax:  +  (“A3“9Ar  +  Al“9AT)aA; 


,  /x  a/(£)  x  a/(0,9/ 
+  (A2“9AT“'Al"5Ar)aA;- 


(4.6.14) 
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Comparing  (4.6.13)  and  (4.6.14),  we  have  following  PDE 


m) 

dni 

om 

dfJ.2 

dm 


=  x2 

=  ~X1 

=  M2 


d_m 

dx2 


A3 

—  A3 
A2 


d_m 

d\2 

e_m 

d\\ 

d_m 

8X2 


=  -Ml 
=  — A2 

=  Ai 
=  0. 


One  can  check  that  function 


J(t)&  =  ~xlfl2  +  x2^\  +  A3 

is  a  solution  of  the  above  PDE.  Therefore,  from  Noether’s  theorem,  this  function  is  a 
constant  along  the  motion  of  Xh12  ,  i.e., 


-X1M2  +  Z2M1  +  A  3  =  C2 


(4.6.15) 


for  some  constant  C2  .  | 

Since  the  reduced  system  has  GT -symmetry,  by  using  standard  Poisson  reduction 
procedure  again,  we  can  drop  the  system  (4.6.7)  to  quotient  space  P  =  P /Gr  — 
T*(5P2  x  SO(3))/ SO(Z)/S1  with  projection  t  :  P  — ►  P.  In  the  following,  we  will 
find  induced  Hamiltonian  H12,  induced  Poisson  structure  A  and  reduced  Hamiltonian 
vector  field  Xr12  on  manifold  P . 

First,  consider  a  change  of  coordinates 

ip  :  P  —*  P 


z  =  (a;1,*2,Mi,M2,  Ai,  A2,  A3)  z'  =  (n, r2, r3, Bu  02, 03,  A3) 
given  by  relations 


(4.6.16a) 


'  xi  =  ricos(0i), 

Ai  =  r3cos(d3), 

x2  =  r1sin(01), 

,  <  A2  =  -r3sin(d3),  (4.6.16b) 

Hi  =  r2cos(#2), 

A3  =  A3. 

.  M2  =  r2sin(02), 


105 


With  this  set  of  new  coordinates,  the  Hamiltonian  H\2  becomes 


H'  12(2')  =^(2 ar2r3sin(93  -  62)  +  2a6A3rir3co.s(03  -  Oi) 
+  2bX3rxr2sin{92  -61)-  a2r\  —  r\  —  b2Xlrl). 


(4.6.17) 


And  the  corresponding  dynamics  Xh>12  is  giyen  by 


r  1  =- 


0i=- 


r2  = 


ar3sin(63  -  9\)  —  r2cos(02  —  61) 


ar3cos(93  -  9X)  +  r2sin{92  —  0\)  -  bX3rx 

2  f! 

abX3r3cos(83  —  d2)  —  b2  X2rxcos(92  —  9\) 


92  =-  - — (abX3r3sin(93  -  62)  +  b2  X23rxsin(92  -  9X)  -  bX3r2 ) 
2r2 


r  3 


aX3r2cos{93  —  92)  —  abX3r\sin(93  —  9X) 


(4.6.18) 


1 


93  =  -  -2—(aX3rosin(93  -  92)  +  (abXlrx  —  abrxrl)cos(93  -  9X) 
2  r3 


A3  =  — 


-  br\T2r3sin{92  -  0i)  +  {b2X3r\  -  a2 X3)r3) 
ar2r3cos(93  -  92)  -  abX3rxr3sin(93  -  9X) 


Observing  that  the  Hamiltonian  H'\2  and  the  right-hand-side  of  differential  equation 
(4.8.18)  depend  on  relative  values  of  9X ,  92  and  63  only,  we  can  reduce  the  order  of  this 
system  as  follows.  Let 


@2i  =  92  —  9i 


832  —  93  —  92. 


By  using  2  =  (ri,r2,r3,62i,032,  X3)  to  parameterize  P,  the  induced  Hamiltonian  on  P 
is  given  by 


H\2{z)  -~{2abX3rir3cos{932  +  02X)  +  2ar2r3sin(032) 


(4.6.19) 


4-  2bX3r\r2sin{62\)  —  a2r\  —  r\  -  b2 X\r\ 


)• 
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The  corresponding  induced  dynamics  X#io  on  P  is  given  by 

ar3sin(6 32  +  02\)  -  r2cos(92X) 


ri 


r2  =  - 


abX3r3cos(032)  -  b2  X3rxcos(92X) 


a&A^rxsin(032  +  021)  -  aX3r2cos(032) 

r3_  _ 


^21  =' 


-{ar2r3cos{932  +  02X)  -  abX3rxr3sin(932) 


832  — 


= 


2rir2 

+  (■ r\  -  b2 Xlrl)sin(02x)) 

~-{{abrxr2rl  -  a&A^rir2)cos(032  +  021) 
zr2r3 

+  (a6A3r3  -  aX3r\)sin{832) 

+  ( brxr\  +  b2  Xlri)r3sin(92i) 

+  ((a2  -  6)A3  -  &2A3r2)r2r3) 
abX3rxr3sin(932  +  92x)  -  ar2r3cos(932 ) 


Moreover,  the  first  integrals  in  (4.5.6)  and  (4.6.15)  now  take  form: 


(4.6.20) 


r3  +  Ao  =  Cx 


(4.6.21) 


and 

rxr2sin(92x)  +  X3  =  C2.  (4.6.22) 

Therefore,  as  we  claimed  before,  the  final  reduced  system  with  the  above  two  integrals 
is  a  four-dimensional  Hamiltonian  system. 

Next,  we  show  that  the  final  reduced  system  (4.6.20)  with  (4.6.21)  and  (4.6.22)  is 
also  Poisson.  In  other  words,  one  should  be  able  to  find  a  Poisson  structure,  A,  on  P 
such  that  equations  (4.6.20)  can  be  expressed  as 

I  =  A  (z)VHX2(z). 

To  this  end,  we  first  look  at  the  Poisson  tensor  A  under  the  new  coordinates  z'  — 
(rx,r2,r3,9x,92,93,X3)  in  P.  Since  zr  =  i>(z)  for  z  =  (xx,  x2,  /ux,/i2,  Xx,  X2,  X3) ,  the 
Poisson  bracket  under  new  coordinates  will  satisfy 

{Fi,F2}(z')  =  {fx,f2}  (4.6.23) 
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where  Fj  =  /,  o  for  fi  G  C°°(P),  i  —  1,2.  The  right-hand-side  of  (4.6.23)  gives 


It  is  easy  to  show  that 


dfj  _  _  <W± 

dz  '  dz’  dzr 


Therefore,  the  Poisson  tensor  under  new  coordinates  is  given  by 

h.'{z')  =  (•^)l7=^-l(7«)A(V,  1(z'))(~q~)T  l7=V'-1(«')' 
From  (4.6.16),  one  can  show  that 


COs(0i) 

sin(Oi) 

0 

0 

0 

0 

°\ 

0 

0 

cos(02) 

sin(d2) 

0 

0 

0 

0 

0 

0 

0 

cos(83) 

-sin(d3) 

0 

sin($i ) 

TX 

cos(fll) 

r\ 

0 

0 

0 

0 

0 

0 

0 

sin($2) 

ri 

cos(  81) 
ri 

0 

0 

0 

0 

0 

0 

0 

sin(83) 

COS(03) 

*•3 

0 

0 

0 

0 

0 

0 

0 

1/ 

Then 


A'(z')  = 


V 


0 

cos (9 2  - 

-0i) 

0 

0 

sin(02~  ^l) 

0 

0 

\ 

cos(02  -  #i) 

0 

0 

sin(fl2— 81 ) 

ri 

r2 

0 

0 

0 

0 

0 

0 

0 

0 

Aa 

0 

0 

sin(02- 

^1) 

0 

0 

COS(02— *l) 

r3 

0 

0 

sin(«2  —  8X) 

r\ 

0 

0 

cos(02  — ®l) 

r\ri 

0 

0 

0 

ri 

\ 

nr2 

0 

1 

0 

0 

_  Aa 

0 

0 

0 

0 

^3 

0 

0 

0 

-1 

0 

/ 

(4.6.24) 


which  depends  on  <?2  —  #1  only. 

We  now  ready  to  determine  the  Poisson  tensor  on  P .  Let  j\  and  f2  be  smooth 
functions  on  P  which  is  parameterized  by  z  =  (ri,  r2,  r3, #21,  #32?  ^3)  -  Let  f\  and  f2  be 
lifed  functions  on  P  such  that  at  lifted  point  7f-1(z)  =  z  =  (ri,  r2,  r3,  Q\,  02,  #3>  A3)  in 
P 

fi(z)  =  fi(z ),  *  =  1,2. 

The  Poisson  bracket  of  f\  and  f2  at  any  point  z  in  P  is  given  by  Poisson  structure 
shown  in  (4.6.23).  We  need  to  find  the  expression  of  {/1,  h}p/G  such  that 

{h,h}p/Gr  oir  =  {7i,/2}p.  (4.6.25) 
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It  is  easy  to  show  that 


/  fZ\ 

or  i 

M. 

9r2 

*L 

®r2. 

IL 

dex 

M. 

dez 

1L 

9^3 

\1L  I 

V  9A3  ' 


(30* 


/I  0  0 
0  10 
0  0  1 
0  0  0 
0  0  0 
Vo  o  o 


0 

0 

0 

-1 

1 

0 


0 

0 

0 

0 

-1 

0 


o°^ 

0 

0 

0 

1/ 


(  §k\ 

1L 

dr 2 

IL 

9r3 

JLL 

d$2i 

IL 

903  2 

\  IL  ) 

V  9A3  7 


or  shortly, 


3J_  _  „df 

dz  ~  dz ' 


Then,  immediately,  from  (4.6.23)  and  (4.6.24),  we  have 

U1J2  }p/GSz)  = 


where 


A(2)  =  ET(Aofr  l)(z)Z 


/ 


0 

— cos(02l) 
0 

) 

r2 

sin(62 1) 
r2 
0 


COs(#2l) 

0 

0 

sin(&2 1 ) 
n 
0 

0 


0 

0 

0 

0 

0 

.  hi 
r3 


sin(B2x) 

T2 

sin( 92i ) 
r\ 

0 

0 

cosC^) 

r\r2 

0 


(*) 


sin(6 21) 
^2 
0 

0 

COs(fl2l) 

rl  r2 
0 


0 

0 

Ai 

r3 

0 

0 

0 


\ 


Therefore,  (4.6.20)  can  be  expressed  by. 


I  =  A  (z)VH12(z) 


for  z  E  P. 
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Appendix  4. A: 

This  appendix  gives  complete  expression  of  fti(xi,£2)  and  ^2(^1, ^2)  in  (4.4.1) 
with  fa  =  0  and  fa  —  ^/2. 

where 

det(Iiock)  =(e  -  l)e3(2c  -  l)m?x\x\ 

+  (e  -  l)e2m2(2€2mg2  -  emq2  +  ely  -  Iy)x\ 

+  (e  -  l)e3(2e  -  l)m3x42;2 
+  e2m2(4e3mq2  -  10 e2mq2  +  4emg2  -  2 elz  +  Iz 

+  €2  Iy  —  2  e.Iy  +  ly  +  €2  lx  ~  2  tlx  +  Ix)X\X2 
+  em(4e3 m2qA  -  2 e2m2qi  +  2 e2 1  zmq2  -  2 elzmq2 
+  2e2 1 ymq2  -  2 elymq2  +  2 e2Ixmq2 

—  elxmq 2  +  elylz  —  lylz  +  elxly  ~~  Ixly)x2 
+  (e  -  l)e2  m2  (2e2  mq2  -  emq2  +  elx  -  Ix)x\ 

+  em(4e37nV  -  2 e2m2qi  +  2 e2Izmq2  -  2 elzmq2 
+  2 e2Iymq2  -  elymq2  +  2e2Ixmq 2 

—  2elxmq2  T  elxlz  Ix^z  d*  e.lxly  IxIy')x\ 

+  Iz(2emq2  +  Ix)(2emq2  +  Iy ) 

wu  =  -  €4(2e  -  l)TO3gxiX3 

-  e3m2q(2emq2  -  mq 2  —  Iz  +  Iy)xix2 

u\2  =  -  (t  -  l)€3(2e  -  l)m3gx4  +  e3(2e  ~  l)m2<?x2X2 

+  e2  m2q(4e2mq2  -  2 em2q2  +  elz  -  Iz  +  2elx  -  Ix)x\ 

+  e2mq(2e2mq2  -  emq 2  +  elx  -  Ix)x\ 

-  emlzq(2emq2  +  Ix) 
u;13  =  -  e4(2e  -  l)m3x2x3 
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-  (e  —  l)e3  m2  (2emq2  —  mq2  +  Iy)x  2 

-  e3m2(2e2mq2  —  emq 2  +  elx  -  Ix)xjx2 
+  e2m{2emq2  +  Ix)(2emq2  -  mq 2  +  Iy)x 2 

cj2i  =e3(2e  -  l)m3 qx\x\ 

+  e2m2q(2e2mq 2  -  emq2  +  ely  -  Iy) x\ 

-  (e  -  l)€3(2e  -  \)m3qx\ 

■f  e2m2q(4e2mq2  -  2 emq2  +  elz  -  I2  +  2 ely  —  Iy)x\ 

-  emlzq(2emq 2  +  Iy) 
cj22  =  -  e4(2c  -  1  )m3qx\x2 

-  e3m2q(2emq2  -  mq2  -  Iz  +  Ix)x\x2 
o>23  =e4(2e  -  1  )m3x\x\ 

+  e3m2(2e2mq2  —  emq2  +  ely  -  Iy)x\x\ 

+  (e  -  l)e3m2(2emq2  -  mq2  +  Ix) x3 

-  e2m(2emq2  +  Iy)(2emq2  -  mq2  +  Ix)x i- 
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CHAPTER  V 


CHAPLYGIN  DYNAMICS  AND 
LAGRANGIAN  REDUCTION 


In  this  chapter,  based  on  the  intrinsic  form  of  Lagrange-d’Alembert  principle 
formulated  in  Chapter  II,  we  study  constrained  Lagrangian  dynamics  with  symmetry 
on  principal  fiber  bundles  with  connections.  We  show  that,  under  certain  constraints 
which  are  of  the  nonholonomic  variety,  there  is  a  family  of  splittings  of  the  Lagrangian 
velocity  phase  space.  Each  splitting  gives  a  Lagrangian  dynamics  on  the  horizontal 
distribution  given  by  a  connection.  If  the  nonholonomic  constraint  is  given  by  the 
horizontal  distribution  of  the  connection  itself,  our  result  leads  to  Koiller’s  formula 
for  non- Abelian  Chaplygin  systems  [20].  If  the  mechanical  connection  is  used  and  the 
exterior  force  leaves  the  momentum  map  invariant,  our  result  leads  to  (non- Abelian) 
Lagrangian  reduction  due  to  Marsden  and  Scheurle  [27]. 

In  Section  5.1,  as  a  motivation,  we  first  consider  a  simple  class  of  systems,  namely, 
Lagrangian  systems  with  Abelian  symmetry  and  affine  constraints.  An  important 
observation  from  studying  such  systems  is  that,  when  a  constrained  system  possesses 
symmetry,  the  dynamics  of  the  system  can  be  described  by  a  reduced  system  of  lower 
dimension  and  without  constraints.  In  Sections  5.2  and  5.3,  we  show  that  for  a  system 
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with  non- Abelian  symmetry  such  a  reduction  can  also  be  obtained  by  going  through  a 
two-step  procedure.  In  Section  5.4,  we  demonstrate  Lagrangian  reduction  by  using  the 
formulas  generated  in  Sections  5.2  and  5.3.  The  first  section  of  Chapter  IV  provides  some 
of  the  mathematical  background  (principal  bundle,  connection,  etc.)  for  this  chapter. 


5.1  Abelian  Chaplygin  Systems  with  Affine  Constraints 

Let  Q  =  X  be  the  n -dimensional  configuration  space  of  the  system 

under  study,  and  q  =  (qi,q2)  a  point  in  Q ,  for  q\  £  and  q2  £  3in-m .  Let  function 
L  :  TQ  — >  5?  be  the  Lagrangian  of  the  system  with  the  property: 

L(q,v)  =  L((qi,0),(v1,v2))  =  L(q1,v1,v2),  (5.1.1) 

where  v  —  (v\,v2)  £  TqQ  ,  for  v\  £  Tqi$lm  and  v2  £  Tq2Stn~m  .  The  constraint  on  the 
system  is  given  by  the  zero  level  set  of  m  linearly  independent  functions  /  :TQ  — >  9fJm 
which  is  of  the  form 

f(q,v)  =  B(q1)v1-v2  +  b(q1),  (5.1.2) 

where  B  :  —>  L(5RTn, 3Jn-7n)  and  b  :  are  smooth  mappings.  In  addition, 

the  exterior  force  or  control  is  given  by  a  mapping  F  :TQ  — ►  T*Q  which  is  of  the  form 

F(q,v)  =  (F1(qi,v1,v2),F2(qi,v1,v2)),  (5.1.3) 

for  Fi(qi,  t>i,  v2)  £  Tqi^lm  and  F2{q\,v\,v2)  £  T*29?n-m.  Here,  as  well  as  in  the  rest 
of  this  chapter,  defining  the  exterior  force  on  TQ  is  equivalent  to  specifying  a  feedback 
control. 

A  general  method  to  determine  a  motion  of  the  system  in  the  above  setting  is 
to  solve  the  (2n  +  m )-dimensional  differential  equations  given  in  (2.2.33)  or  (2.2.34). 
However,  by  observing  the  properties  of  the  Lagrangian,  the  constraints  and  the  exterior 
force  given  in  (5.1.1)-(5.1.3),  one  can  show  that  the  problem  can  be  reduced  to  solving 
a  lower  dimensional  system  of  differential  equations  without  constraints.  Classically, 
a  system  as  defined  above  is  called  a  Chaplygin  system  provided  that  b(q\)  =  0  [32]. 
It  has  been  shown  that  for  such  systems,  the  dynamic  equations  can  be  reduced 
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to  unconstrained  equations  on  the  space  parametrized  by  q\  with  a  modified 
Lagrangian  and  a  modified  exterior  force.  For  the  proof  of  this  assertion  and  examples  of 
physical  systems,  see  [32].  From  a  geometric  point  of  view,  it  is  clear  that  the  conditions 
for  a  system  to  be  a  Chaplygin  system  is  that  the  Lagrangian,  the  exterior  force  and  the 
constraints  are  invariant  under  the  action  of  the  Abelian  group  ,  namely, 

$  :  Mn-m  xQ^Q 


(5.1.4) 


(2,(91.92))  i-»  (91,92+2). 

Because  of  this,  we  refer  to  a  classical  Chaplygin  system  as  an  Abelian  Chaplygin  system 
with  linear  constraints  and  the  system  given  by  (5.1.1)-(5.1.3)  as  an  Abelian  Chaplygin 
system  with  affine  constraints.  Next,  we  show  how  to  derive  reduced  dynamic  equations 
on  parametrized  by  (qi,  iq)  from  the  original  system  with  affine  constraints. 

Consider  an  Abelian  Chaplygin  system  with  affine  constraints  given  by  (5.1.1)- 
(5.1.3).  From  Lagrange- d’Alembert  principle  formulated  in  (2.2.33),  the  dynamics  is 
given  by 


dt 


DvL(q,  v )  -  DgL(q,  v)  •  u  =  F(q,  v )  •  u 


(5.1.5a) 


with  ( q,v )  =  ((qi,q2),(vi,v2))  6  TQ  satisfying 


f{q,v)  =  B(qi)vi  -  v2  +  b(q2 )  =  0  (5.1.5b) 


and  u  =  (tti,u 2)  €  TqQ  satisfying 


Dvf(q,  v)-u=  B(qi  )Ui  -  u2  =  0. 


(5.1.5c) 


Using  (5.1.5c)  and  (5.1.1),  Equation  (5.1.5a)  can  be  re-written  as 
(^ DVlL(q,v )  -  DqiL(q,v))- +  ^(Dv?L(q,  v))  •  B(qx  )ui 


where 


F((7i,ni)  •  mi, 


F(qi,vi)  =  Fi(qi,vi,v2)\V2  +  B(qi)*F2(qi,Vi,v2)\V2 


(5.1.6) 


(5.1.7) 


and  (’)l«2  —  (') Iti2=s(9j)ui +(>(gi)  •  Let 


L(7i,ui)  =  L(qi,vuB{qi)vi  +&(<7i)). 


(5.1.8) 
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Then,  it  is  easy  to  check  the  following  equalities 


DVlL(qi,v x,t?2)  •  ux  =  DVlL(qi,Vi)  •  ux  -  DV2L(qx,vx,  v2)|„2  •  (£(gX)«i) 
DgiL(gx,t>x,v2)  •  ux  =DgiL(gx,ux)  •  ux  -  DV22/(gx,nx,n2)|„2  •  ((DqiB(qx)ux)vx) 

-  DV2L(qx , ux, u2)|„2  -(Dqib(qx)ux), 

where  DqxB{qx)  :  -*  £(&"*,£”-”*)  and  Dqib(qx)  :  ->  &n~m  are  the  Frechet 

derivatives  of  B(qx)  and  6(gx),  respectively.  Substituting  from  the  above  two  equations 
in  (5.1.6),  one  gets 

(■^DVlL(qx,vx)-DqiL(qx,vx))  ■  ux 

=DV2L\V2  ■  [(DqiB(qx)vx) ux  -  (DqiB(qx)ux)vx] 

-  DV2L\V2  ■  Dqib(qx)ux  +  F(qx,vi)  ■  ux 


—  {Dv2L\v2,  dB(qx)(vx,ux)) 

-  (DV2L\V2,  Dqib(qx)ux)  +  F{qx,vx)  •  ux, 

where  dB(qx)(vx,  ux)  =  ( DqiB(qx)vx)ux  —  ( DqiB(qx)ux)vx  6  Kn_m  .  Writing  things  in 
terms  of  components,  we  have,  for  B(qx)  =  [B'i ,i  —  1  —  m;j  = 

vx  =  (v%,k  =  1,  •  •  -,m)  and  ux  =  ( u[,l  =  1  •  -,m), 

m  m  /apik  f)  nil 

(^x^or-EEf 

*=i  (=i v 

for  i  —  1,  ■ 


dq[  dqkv 


V\ L«i 


,n  —  m. 

We  summarize  the  above  result  in  the  following  theorem. 

Theorem  5.1.1:  If  a  curve  {q(t)  =  ( qx(t ),  g2(t)),  t  >  0}  in  Q  =  X  &n_m  is  a  motion 
of  the  constrained  Lagrangian  system  with  the  Lagrangian  L,  the  constraints  and  the 
exterior  force  given  in  (5. l.l)-(5. 1.3),  respectively,  then  the  curve  {qx(t),t  >  0}  is  the 
motion  of  the  unconstrained  Lagrangian  system  in  with  Lagrangian  L  given  by 
(5.1.8)  and  the  exterior  forces  given  by  (DV2L\V2,  dB(qx)(vx,  •)) ,  -{D  qxb{qx)y  DV2L\V2 
and  F(qx,vx)  as  in  (5.1.7),  i.e.,  it  satisfies  the  reduced  dynamics 

— DVlL(qx,vx) -ux  -  DqiL(qx,vx)-ux  =F(qx,vx)-ux  -  {DV2L\V2,  Dqib{qx)ux) 
dt  (5.1.9) 

+  (DV2L\V2,dB(qx)(vx,ux)) 


for  any  ux  £  9ir 


115 


Remarks  5.1.2: 


(1)  If  {qi(t),t  >  0}  is  known,  the  curve  {q2(t),t  >  0}  can  be  determined  uniquely 
from  constraint  equations  by  quadrature,  i.e., 

q2(t)=  [  S(?1(r))g1(r)  +  6(g1(r))dr. 

Jo 

It  is  obvious  that  the  curve  {(qi(t),  q2(t)),t  >  0}  satisfies  the  original  differential 
equation  (5.1.5).  Therefore,  the  condition  (5.1.9)  in  Theorem  5.1.1  is  also  necessary. 

(2)  The  results  of  Theorem  5.1.1  are  applicable  to  a  system  with  Lagrangian  being  not 
necessarily  of  the  form,  kinetic  energy  minus  potential  energy. 

(3)  If  the  configuration  space  Q  in  Theorem  5.1.1  is  replaced  by  B  x  G  where  B  is 
an  m-dimensional  manifold  and  G  is  an  (n  —  m) -dimensional  Abelian  symmetry 
group  of  the  system  (i.e.,  is  replaced  by  B  and  Kn_m  is  replaced  by  G),  then 
it  is  easy  to  see  that  Theorem  5.1.1  is  still  valid  and  Equation  (5.1.9)  is  just  the 
local  form  of  the  dynamics  on  B . 

(4)  An  important  observation  here  is  that,  as  in  the  case  of  a  Chaplygin  system  with 
linear  constraints,  when  the  system  satisfies  conditions  (5.1.1)-(5.1.3),  or  symmetry, 
one  can  also  simplify  the  problem  of  solving  constrained  dynamic  equations  given 
in  (5.1.5)  to  the  problem  of  solving  the  unconstrained  dynamic  equations  given  in 
(5.1.9)  of  lower  dimension.  As  we  shall  see  in  the  following  sections,  for  systems 
with  non- Abelian  symmetry,  such  a  simplification  is  also  possible,  but  in  two  steps. 

■ 

Technically,  the  above  derivation  looks  very  elementary  and  quite  the  same  as  the 
one  for  a  system  with  linear  constraints.  However,  by  applying  this  result  to  a  system 
with  a  constant  momentum  map  or  a  conservation  law,  we  will  discover  the  reduced 
dynamics  originally  derived  by  Routh  [3,38],  who  obtained  such  equations  by  defining  a 
new  function,  called  the  Routliian,  through  application  of  the  so-called  cyclic  coordinates. 
It  is  our  re-formulation  of  Routh ’s  problem  below  via  the  nonholonomic  constraints  that 
motivated  us  to  study  more  general  systems,  namely,  non- Abelian  Lagrangian  reduction, 
as  worked  out  in  the  later  sections. 
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Consider  a  simple  mechanical  system  with  (Abelian)  symmetry, 


0 Q  =  Un,K,V,G=ftn-m). 


(5.1.10) 


Here,  the  Riemannian  metric  is  written  as 


K(q)(v,w) 


vTM(qi)w  =  ( viT,V2T ) 


(  Mu(q i) 

V  ^12(91  )T 


fwi\ 

M22(qi)  J  \w2J’ 


where  v\,W\  6  and  v2,w2  S  5N1-”1 ,  MtJ(gi )  are  the  snbmatrices  of  the  symmetric 
matrix  M  of  suitable  dimensions,  and  the  potential  function  satisfies  V  =  V(q\).  The 
action  of  G  on  Q  is  given  by  (5.1.4).  Then,  the  Lagrangian  of  the  system  is 
1  T 

L(qi,vi,v2)  =-v  M(q1)v-V{ql) 

1  !  (5.1.11) 

=  2Wi T  M\\(q\)v\  +  M\2(q\)v2  +  -v2T  M22(q2)v2  —  ^(91). 


The  Lie  algebra  of  G  =  3£n_TO  is  Q  =  9£n-Tn .  It  is  easy  to  check  that,  if  £  €  G , 
the  infinitesimal  generator  of  the  action  $  in  (5.1.4)  with  respect  to  £  is  simply 
£<3(9)  =  (0,f)>  where  0  is  the  null  vector  in  Then,  by  (2.1.27),  the  momentum 
map  with  respect  to  the  tangent  action  of  $  is 


J(9,v)  =  Ml2(qx)vx  +  M22{q\)v2.  (5.1.12) 


Assume  that  the  exterior  force  or  control  leaves  /z  =  J(<jr,u)  invariant.  Since  is 
fixed,  the  problem  now  is  to  determine  the  dynamics  under  the  constraints  given  by  the 
constant  momentum  map.  Note  that  the  Lagrangian  given  in  (5.1.11)  is  G-invariant 
and  J(g,  v)  —  fi  can  be  re-arranged  as 


v2  =  ~M22{q1)M22{qx)v1  +  M22(qx)n 


=  B(qi)vi  +  6(91), 


(5.1.13) 


where  B(qx)  =  —  M22  (qi)Mf2(qx)  and  b(qx)  =  M22(qx)n,  which  is  also  G-invariant. 
Therefore,  we  have  an  Abelian  Chaplygin  system  with  affine  constraints.  From  the 
definition  of  L  in  (5.1.8),  after  a  simple  calculation,  we  have 

L(qi,n)  =^ViT{Mu  ~  M\2M22  MX2)v\  +  ^/zT Af^V  -  V(?i) 

=  ^vxtMvx  +  ^/iTM2^V  -  ^(91), 
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where  M  =  Mu  —  Mx2M22Mx2, 


Dv^L(qi,v1,v2)\V2  =  fi, 


and 


(DV2L\V2,  Dqib(qi)ui)  =  (n,  ( DqiM221{q1)u1)fi )  =  Dqi(pT M22l(qx)p)  ■  ux. 

From  the  above  calculations,  we  see  that  if  we  define  a  function,  referred  to  as  the 
reduced  Lagrangian ,  as 

L^(quVi)  =  ^viT M(qi)v\  -  (^fiT  M22x  (qx)fJ,  +  V(?i)),  (5.1.14) 

equation  (5.1.9)  specializes  to 

^D^L^quVx)  -  ■  u2  =  (/x,  dB(qx)(vx,  ux))  +  F(qx,  iq)  •  ux.  (5.1.15) 

Since  the  infinitesimal  generator  of  the  action  in  (5.1.4)  corresponding  to  £  6  Q 
is  £q(?)  =  (0,£)>  the  submatrix  ff(gi)  =  M22{qx)  is  the  locked  inertia  tensor  (cf. 
Subsection  4.1.2).  Then, 

V^~^I -\qx)i*  +  V{qx) 

is  known  as  the  amended  potential  (cf.  [41]).  In  addition,  from  a  geometric  point  of  view, 
we  claim  that  —dB(qx)  is,  in  fact,  the  curvature  form,  f2,  of  the  mechanical  connection 
on  the  principal  bundle  (5ftn,  5im,  K”-TO)  and  the  first  term  in  (5.1.15)  is  known  as  the 
/x-component  of  the  curvature  form  evaluated  at  (ux,v\),  i.e., 

vi)  =  =  (/*,  ~dB(qx)(ux,vx)}.  (5.1.16) 

Recall  that  the  connection  form  u(q)  of  the  mechanical  connection  on  Q  is  given  as 

(«(?)>«)  =  I(g)-1J(tf,t>), 

where  I  is  locked  inertia  tensor  and  J  is  the  momentum  map.  From  the  above  discussion 
and  (5.1.12)  we  have 

(u(q),v)  =  M22  (71). Mx2(qx)v\  +  v2. 
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This  implies  that  the  connection  form  is 

u(qi)  =  M22  (q1)M[2(q1)dq1  +  dq2 
=  -B{q\)dq1  +  dq2. 

Since  the  symmetry  group  here  is  Abelian,  by  the  definition  of  curvature  form  (cf. 
Subsection  4.1.1),  the  above  equation  justifies  our  claim. 

In  summary,  we  state  the  result  as  a  theorem. 

Theorem  5.1.3:  If  a  curve  {q(t)  =  (qi(t),q2(t)),t  >  0}  in  Q  —  -ft"1  X  5in_m  is  a 
motion  of  the  simple  mechanical  system  with  symmetry  given  in  (5.1.10)  together  with 
exterior  force  (5.1.3)  which  leaves  the  momentum  map  given  in  (5.1.12)  invariant,  i.e., 
J (q,q)  =  fi,  then  the  curve  {q\(t),t  >  0}  in  !ftm  satisfies  Euler- Lagrange  equation 

•ni  =  ftM(?i)(«i,Vi)  +  F(qi,vi)  ■  ui,  (5.1.17) 

where  is  given  in  (5.1.16)  and  F(<?i,ui)  given  in  (5.1.7)  for  B  therein  given  in 
(5.1.13).  ■ 

Remarks  5.1.4: 

(1)  If  Equation  (5.1.15)  or  (5.1.17)  is  written  in  components,  one  gets  the  reduced 
dynamic  equations  due  to  Routh  [38]. 

(2)  Theorem  5.1.3  can  be  extended  to  a  system  on  any  n -dimensional  smooth  manifold 

Q ,  and  (n  —  m) -dimensional  Abelian  symmetry  group  G.  With  this  extension, 
Equation  (5.1.17)  gives  the  dynamics  or  special  vector  fields  (cf.  Chapter  II),  in 
local  coordinates,  on  T(Q/G).  Then,  the  Theorem  (5.1.3)  is  known  as  Lagrangian 
reduction  for  the  system  admitting  Abelian  symmetry,  which  is  also  shown  in  [3].  A 
natural  question  that  follows  is  to  work  out  the  reduction  theorem  for  the  systems 
with  non- Abelian  symmetry.  We  will  answer  this  question  in  Section  4.  | 

5.2  Dynamics  on  Horizontal  Distribution 

Consider  a  simple  mechanical  system  with  symmetry  given  by  a  four-tuple 

(Q,ff,V,G),  (5.2.1) 


^DVlL^.(q\,vi)  Dg1L 
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where  Q  is  the  n -dimensional  configuration  space;  G  is  a  Lie  group  of  dimension  p 
acting  on  Q  on  the  left  freely  and  properly.  This  action  is  denoted  by  $  given  in  (2.1.1); 
K  is  a  Riemannian  metric  and  G  acts  on  Q  by  isometries;  V  is  a  G -invariant  potential 
function  (cf.  Subsection  2.1.4).  The  Lagrangian  of  this  system  is  given  by 

L(q,v,)=l-K(q)(v„v,)-V(q).  (5.2.2) 

In  addition,  we  let  the  configuration  space  Q  be  a  principal  G'  -bundle: 

p  =  (Q,  5,  *,£?'),  (5.2.3) 

where  G'  is  a  p' -dimensional  closed  subgroup  of  G,  B  =  QjG'  is  the  m(—  n  —  p') 
dimensional  base  space  and  7r  :  Q  — ►  B  is  the  bundle  projection.  From  Section  4.1, 
we  know  that  on  this  bundle  one  can  choose  a  connection,  which  defines  a  horizontal 
distribution  on  TQ  (cf.  Definition  4.1.3),  and  associated  to  such  a  connection,  there  is 
a  unique  connection  form,  u  £  wl(Q;Q').  We  now  consider  a  class  of  constraints  which 
relate  to  the  connection  form  as  follows. 

Constraint  Hypothesis  5.2.1:  We  assume  that  the  motion  of  the  system,  (<?(•),  vq(-)) , 
is  constrained  to  a  2n  —  p'  dimensional  subspace  of  TQ  defined  by 

S  =  {  (?,  vq)  E  TQ  |  u(q)(vq)  =  £(?)  },  (5.2.4) 

where  the  mapping  £  :  Q  — *  G'  is  smooth  and  also  G'-equivariant,  i.e.,  £(<7  •  q)  = 
Adg£(q),Vg  £  G' .  I 

Remark  5.2.2:  Since  Q'  is  a  p' -dimensional  vector  space  and  u(q)(vq)  is  linear  in  vq , 
the  above  assumption  can  be  viewed  as  giving  p'  affine  constraints  on  TQ .  In  addition, 
if  £(9)  =  0,  the  subspace  S  is  just  the  horizontal  distribution.  | 

With  the  above  setting,  according  to  equations  (2.2.33)  derived  from  Lagrange- 
d’Alembert  principle,  the  dynamics  of  the  system  is  given  by  the  following  equations, 

d 

—  Do  L(q,  vq)  ■  uq  -  DxL(q,vq)  ■  uq  =  F  ■  uq  (5.2.5a) 

for  ( q,vq )  satisfying 

«(?)(«,)  =  m  (5.2.5b) 
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and  uq  belonging  to  the  horizontal  snbspace  at  q,  H,  C  TqQ ,  i.e., 

w(?)(«g)  =  o.  (5.2.5c) 

Here,  F  is  an  exterior  force  satisfying  the  following  hypothesis: 

Exterior  Force  Hypothesis  5.2.3:  We  assume  that  an  exterior  force  is  a  mapping 

F:TQ  ->  T*Q 


such  that  \/(q,  vq) , 

F($9(<l),Tq$a-vq)  =  T;$g-F(q,vq)  Vg€G,  (5.2.6) 

i.e.,  it  is  G-equivariant.  g 

Let  q(-)  =  {q(t),t  >  0}  be  the  solution  of  (5.2.5)  with  initial  condition  q(0)  =  qo, 
and  x(-)  =  { x(t )  =  7r (q(t)),t  >  0}  the  projection  of  q(-)  on  B.  Our  final  goal  is  to 
explicitly  formulate  the  projected  dynamics  on  TB .  Since  we  assume  that  the  structure 
group  G'  is  non- Abelian  in  general,  such  a  formulation  is  no  longer  a  straightforward 
calculation  as  we  have  done  in  the  previous  section.  Our  strategy  now  is  to  determine 
the  unconstrained  dynamics  on  the  horizontal  subspace  for  the  given  connection,  and 
then,  project  it  to  the  base  space  B . 

Given  a  connection  on  Q,  let  r(-)  =  {r(t),t  >  0}  be  the  horizontal  lift  of  x(-) 
to  Q  for  given  r(0)  =  ro,7r(ro)  =  x(0)  =  7r(go)-  From  Theorem  4.1.7,  we  know  that 
this  lifted  curve  is  unique.  The  question  we  will  address  is  that  if  the  left-hand-side 
of  Euler-Lagrange  equation  (5.2.5a)  is  restricted  to  the  horizontal  curve  r(-),  what  the 
dynamic  equation  should  be,  i.e.,  we  will  answer  the  following  question: 

^ D2L(r ,  vr)  ■  ur  -  DiL(r,  vr)  ■  ur  =?,  (5.2.7) 

where  ur  6  Hr  (the  horizontal  subspace  at  r )  and  vr  =  r(t) , 

Remark  5.2.4:  Equivalently,  one  can  ask  what  is  the  form  of  equation  (5.2.5a)  if  vq 
is  split  into  horizontal  and  vertical  components.  g 

From  the  uniqueness  of  horizontal  lift,  we  know  that  for  a  given  q(-)  in  Q  and  the 
horizontal  lift  r(-),  there  exists  a  unique  curve  g(-)  =  {g(t),t  >  0}  Q  G'  such  that 

q(t)  =  $(g{t),  r(t ))  =  $fl(r)  =  g  ■  r. 
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Then, 


vq(t)  =  q(t)  =Tr^g-vr  + —  $(g(r)g  1(t),g(t)  ■  r(t)) 

aT  T  =t 

=  Tr$g  •  vr  +  [TgRg-ig(t)]Q(q) 

=  Tr$g  •  vr  +  [i l(t)]Q(q),  (5.2.8) 

where  r)(t)  =  TgRg-ig(t)  £  Q'  for  Q'  the  Lie  algebra  of  G' .  Equation  (5.2.8)  presents 
the  splitting  of  a  tangent  vector  on  Q  according  to  a  choice  of  connection.  Evaluate 
ui(q)  on  both  side  of  (5.2.8).  Since  vr  is  horizontal,  by  Constraint  Hypothesis  5.2.1  we 
have 

V(t)  =  u(q)(vq)  =  £{q)-  (5.2.9) 

Therefore, 

vq{t)  =  Tr$a-vr  +  [Z(q)]Q(q).  (5.2.10) 

Since  the  connection  form  has  equivariant  property,  we  have 

VT  =  Tq$g-1  ■  vq  -  Tq$g-x  •  [{(«)]g(?) 

=  Tq ■  Vq  -  [A dg-i^q)\Q{g~l  ■  q) 

=  r)V1^rWr)'  (5.2.11) 

For  simplicity,  we  sometimes  abbreviate  [£(-)]q(')  by  £q(-)-  But  when  it  is  necessary, 
e.g.  operating  on  derivatives  or  displaying  the  final  results,  we  will  use  the  complete 
notation. 

In  the  following  derivations,  we  will  frequently  use  Equations  (5.2.6),  (5.2.10), 
(5.2.11),  the  G-invariant  property  of  the  Riemannian  metric  and  potential  energy,  and 
the  chain  rule  in  differentiation.  The  derivations  will  be  carried  out  in  local  coordinates, 
but  in  an  intrinsic  way. 

Substituting  vr  from  (5.2.11)  into  the  first  term  of  left-hand-side  of  (5.2.7),  we 

have 

~D2L(r,vr )  -ur  =  ^ K(r)(vr,ur ) 

=  ^A'(r)(Tg#a-i  •  vq,uT)  -  ^K(r)(Tq$g-i  -  [^(<?)]Q(g),  uT) 

=  A  -  B.  (5.2.12) 
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Here,  A  can  be  further  expanded  as 

d 
dt 
d_ 
dr 


/?l  (9(t))(n9(t), Tr$3(T)  •  ur) 


r=t 


A  =  —  K(g  •  r)(vq,Tr$g  •  ur ) 

K(q(T))(vg(T),Tr$ g(t)  ■Ur)+^ 

(5.2.13) 

Let  uq  =  Tr$g  ■  ur  which  certainly  satisfies  condition  (5.2.5c).  Then,  from  (5.2.5a),  we 


=  A\  +  Ai . 


have 


Ai  ~-{DqK{q)  ■  uq)(vq,vq)  -  DqV(q )  •  uq  +  F(q,  vq )  •  uq 

=  ^(DqK(g  •  r)  ■  Tr$ gur)(Tr$ g(vr  +  £g(r)),Tr$3(ur  +  £g(r))) 


d_ 

de 


(DqV(g  ■  r )  •  Tr$5ur)  +  F(<?  •  r,  T,S>r  +  [£(r)]g(r)))  •  Tr$3ur 
[i/i'(<7  •  (r  +  cur))(Tr$3(nr  +  fg(r)),Tr$3(ur  +  fg(r))) 


e=0 


-  V(g  ■  (r  +  ctir))]  +  T;$gF(r,  vr  +  [£(r)]g(r))  •  Tr4> 


gUr 


=  -(DrK(r)  ■  ur)(vr,vr)  -  DrV(r )  •  +  F(r,nr  +  [^(r)]g(r))  •  ur 

+  \{DrK{r )  •  ttr)(£g(r),  £g(r))  +  (Z?r/if(r)  •  ur )(£g(r),  t>r).  (5.2.14) 

Note  that,  in  the  above  derivation,  we  used  the  Exterior  Force  Hypothesis  5.2.3.  The 
first  two  terms  in  (5.2.14)  is  in  fact  the  second  term  in  (5.2.6),  that  is,  DiL(r,  vr)  ■  ur. 
We  now  consider  Ai  in  (5.2.13).  Note  that 

${g{t),r+  eur) 


d  d  d 

— Tr$3(g  ■  ur  =  — — 


dt  de 
d 


de 

d_ 

de 


e=0 

d 


e=0dT 


$(g{r)g  1{t),g(t)-(r  +  eur)) 


r=t 


[Z(9  ■  r)]Q(g  ■  ( r  +  eur )) 


6  =  0 


[£(r)]Q(r  +  «*r) 


—  71  $  .  — 

*l«=0 
=  Tr$s  -(D^Q(r)  -ur). 


(5.2.15) 


Therefore, 

«^2  =  K{q)(vqi  '  ur ) 

=  /i(5T  •  r)(Tr$3  •  (ur  +  fg(r)),Tr$g  •  (Dr^Q(r)  ■  ur )) 

=  K (r)(vr,  DrZQ(r)  ■  uT)  +  K(r)(£Q(r),  Dr£Q(r)  ■  ur).  (5.2.16) 
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Finally,  B  in  (5.2.12)  can  be  re-arranged  as  follows: 


=(DqK(q)  ■  vq)(SQ(q),  uq)  +  K(q)([Dg£(q)  ■  vq]Q{q),  uq) 
+  K{q){D^Q{q)  ■  vq,  uq)  +  K(q)(€Q(q),  jTt§  g(t)ur) 
^Bl+B2  +  B3  +  B4t 


where 


Bi  —(DqK(g  •  r)  ■  Tr$g(vr  +  ^(r)))(Tr$^Q(r), Tr$gur ) 
=  7-  K{9  ■  (r  +  e(vr  +  ZQ(r))))(Tr$gZQ(r),Tr$gur) 

de  e=0 

=  X  ff(r  +  €(t>r  +  fg(r)))(6}(r),ttr) 

ae  e=0 

=(DrK(r )  ■  (vr  +  fa(r)))((Q(r),Ur), 


32  =Km 


f(?  +  evq)\Q(q),Tr$gUr) 


6  =  0 


=K(9  ■  r)([-r 

“€U=o 

=K{g-r)(Tr$g[d 


Z{g  ■  (r  +  e(vr  +  ZQ(r)))]Q(g  ■  r),Tr$gur ) 
£(r  +  e{vr  +  fg(r)))]g(r),  Tr$gur) 


de 


6=0 


=K(r)([DrZ{r)  ■  (vr  +  fg(r))]g(r),ur), 


83  =KU)(- 
—K{g  ■  r)( 


[£(«)]<}(? +  «>g),*«) 


6  =  0 

d 

de 


[£(5  ’  r)h(g  ■  ( r  +  e{vr  +  fg(r)))),  Tr$gur ) 


6=0 


=K{r)(Dr£Q(r)  ■  (vr  +  fg(r)),  ur), 


B 4  =/v'(<7  ■  r)(Tr$^Q(r),Tr$aF>r^g(r)  -  «r) 
=K(r)(Dr£Q(r)  ■  ur,£Q(r)). 


From  (5.2.12)-(5.2.21),  we  get  the  answer  to  the  question  in  (5.2.7)  as  follows: 


(5.2.17) 


(5.2.18) 


(5.2.19) 


(5.2.20) 


(5.2.21) 
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37 D2L(r,vr )  •  «r  -  DiL(r,vr)  ■  uT  =F(r,vr  +  [f(r)te(r))  •  ur 
at 

+  7^(DTK(r)  •  ur)(£Q(r),£Q(r)) 
+  (DrK(r)-Ur)(tQ(r),vr) 


+  K(r){vT ,  Dr£Q(r)  ■  ur) 


-  (. DrK(r )  ■  (vr  +  £g(r)))(£g(r),  ur) 

~  K(r)([D^(r)  •  (vr  +  £g(r))]g(r),  uT) 

-  K(r)(DrtQ{r)  ■  (vr  +  fg(r)),  tir).  (5.2.22) 


Next,  we  will  make  (5.2.22)  more  compact  by  defining  new  functions.  Let 


V€(r)  =  V(r)+  ^ K(r)([Z(r)]Q(r),[Z(r)]Q(r )) 

=  F(r)  +  i(I(r)e(r),^(r)), 

where  the  mapping  I (r)  :  G  -*  G*  has  been  defined  in  (4.1.9).  Then 
DrV*(r)  ■  ur  —DrV(r)  ■  ur  +  ^( DrK(r )  ■  ur)(£Q(r),£Q(r)) 

+  K(r)([Dr£(r)  •  ur]Q(r),tQ(r))  +  K(r)(Dr^Q(r)  ■  ttr,£g(r)). 
Define  a  new  Lagrangian  on  horizontal  space  Hr , 

L^(r,vr)  =  ~K{r){vr,  vr)  -  V*(r) 


(5.2.23) 


(5.2.24) 


and  a  function  on  TQ : 


E{r){ur,vr)  =  K(r)(vr,[DT£{r)  •  wr]g(r)). 


Then  (5.2.22)  becomes, 

~D2L^(r,vr)  ■  uT  -  D1Li(r,vr)  ■  ur  =F(r,vr  +  [£(r)]g(r))  •  ur 
at 

+  (DrK(r)-ur)(tQ{r),vr  +  tQ(r)) 

+  K(r)(Dr£Q(r)  ■  ur,  vr  +  £g(r)) 

+  K (r)([Dr£(r)  •  ur)Q{r),  vr  +  £q(r)) 

-  ( DrK(r )  ■  (vr  +  £g(r)))(fg(r),tir) 

-  K(r)(Dr{Q(r)  ■  (vr  +  £g(r)),ttP) 

-  K{r){[Dr£{r)  •  (vr  +  fg(r))]g(r),tir) 

-  E(r)(ur,vr) 
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=  F(r,  vr  +  [f(r)]g(r))  •  ur  +  {Dr[Kl ?(r)([£(r)]g(r))]  •  ur)  ■  ( vr  +  fg(r)) 

-  (Fr[A4(r)([^(r)]g(r))]  •  ( vr  +  fg(r)))  ■  ur  -  ~(r)(ur,  vr) 

=F{r ,  vr  +  [£(r)]g(r))  •  uT  +  dwc(r)(ur,  vT  +  fg(r))  -  E!(r)(ur,  vr ), 

where  u^(r)  =  -^b(r)([^(r)]g(r))  is  a  real-valued  one-form  on  Q  and  K b  :  TQ  -*  T*Q 
is  the  usual  Legendre  transform.  In  the  above  derivation,  we  used  the  following  fact  in 
local  coordinates, 

da(r)(X ,  y)  =  (£>a(r)  -X)-Y-  ( Da(r )  ■  Y)  ■  X  Vr  €  Q 


for  X,Y  6  3J(Q)  and  any  one-form  a  on  Q . 

Up  to  now,  we  have  proved  the  following  theorem. 

Theorem  5.2.5:  If  q(-)  is  a  solution  of  the  constrained  dynamics  (5.2.5),  then,  for 
given  choice  of  connection  on  principal  G'-bundle  (5.2.3),  any  horizontal  lift,  r(-),  of 
<7(-)’s  projection  satisfies  the  unconstrained  equation 

^-D2A4(r,ur)  •  ur  -  DiL*(r,vT)  ■  uT  =F(r,vr  +  [f(r)]g(r))  •  ur 
at 

+  du>^(r)(ur,  vr )  +  du>z(r)(ur,  [f  (r)]g(r)) 

—  E(r)(ur,  vr) 

(5.2.25) 


for  any  uT  G  Hr  C  TrQ ,  where 


i€(r,ur)  =  ^K(r)(vr,vr)  -  V^(r) 

V€(r)  =  V(r )  +  ~A'(r)([^(r)]g(r),  [£(r)]g(r)) 

=  V(r)  +  ±(I(rX(r),{(r)), 
cjt(r)  =  A't>(r)([^(r)]g(r)), 

E (r)(ur,vr)  =  AT(r)(ur,[J5r^(r)  •  ur]g(r)).  I 


Remarks  5.2.6: 

(1)  From  the  derivation  of  du^ ,  the  force 

Fgyro(r,vr )  =  dw$(r)(  •  ,  vr)  (5.2.26) 

has  the  property  of  a  gyroscopic  force. 
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(2)  One  can  show  that  Equation  (5.2.22)  can  be  written  in  an  alternative  form  without 
resort  to  the  function  V * ,  that  is, 

^ D2L(r,vr )  ■  ur  -  DiL(r,vr)-ur  =F(r,vr  +  [^(r)]g(r))  •  ur 

+  ^r(I(r)((r),((r))^r 
+  duf.(r)(ur,  t;r)  -  S(r)(ttr,  vr  +  &j(r)). 

We  are  not  going  to  use  this  form  since,  when  we  consider  Lagrangian  reduction 
later,  Equation  (5.2.25)  will  give  us  a  straightforward  answer. 

(3)  Once  a  horizontal  curve  is  determined  by  solving  the  unconstrained  equation 
(5.2.25)  for  an  initial  condition  r(0),  the  solution  for  the  original  constrained 
equations  (5.2.5)  can  be  determined  by  first  solving  the  differential  equation 

g(t)  =  g(t)  ■  £(r{t)) 

for  </(0)  satisfying  <?(0)  =  <7(0)  •  r(0) ,  and  then  setting 

?(*)  =  9{t)-r(t). 

(4)  The  equilibria  of  the  system  are  determined  by  the  solution  of  algebraic  equations 

DiV*(r)  -  du>z(r)(  ■  ,  [f(r)]Q(r))  =  0.  I 


5.3  Non-Abelian  Chaplygin  Systems 

In  this  section,  we  show  how  to  drop  the  unconstrained  dynamics  on  horizontal 
bundle  given  in  (5.2.25)  down  to  the  base  space  for  a  given  principal  fiber  bundle.  To 
get  explicit  expressions,  we  will  consider  the  formulation  on  product  bundles.  Since  a 
principal  fiber  bundle  is  locally  trivial,  the  following  results  will  be  true  locally  in  general. 
Also,  for  simplicity,  we  will  assume  the  symmetry  group  in  (5.2.1)  is  the  same  as  the 
structure  group  of  the  principal  bundle  (5.2.3)  (i.e.,  G'  =  G). 

Let  Q  —  B  X  G  be  the  configuration  space  parametrized  by  q  —  (x,h)  for  x  G  B 
and  h  G  G .  Then,  the  tangent  space  is  TqQ  =  TXB  x  ThG .  The  tangent  vector  at  any 
point  q  in  Q  is  given  by 

=  [  1  h  '  C  }(x,h)  (5.3.1) 
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for  some  £  G  Q ,  where  h  •  £  =  TeLhQ.  Consider  the  principal  bundle  ( B  x  G,B,G), 
where  G  acts  on  Q  =  B  X  G  on  the  left  as  shown  in  Subsection  4.1.2,  and  a  connection 
given  by  connection  form  tj  G  vj  1(Q;  Q).  One  can  show  that,  on  this  bundle  there  exists 
an  unique  (/-valued  one-form,  0,  on  B  such  that,  at  each  point  q  =  (x,h)  in  Q , 

u(q)  ■  vq  -  Adh(C  +  u(x)  •  vx),  (5.3.2) 

where  vq  is  given  in  (5.3.1)  [16].  We  refer  to  w  as  the  pull-down  connection  form, 
which  is  also  the  usual  notion  of  local  connection  form  if  a  non-trivial  principal  bundle  is 
considered.  The  proof  of  (5.3.2)  can  be  verified  by  letting  vq  be  horizontal  and  comparing 
(5.3.2)  with  (4.1.6).  With  the  above  connection,  the  tangent  vector  vq  on  Q  at  q  =  (x,  h) 
has  its  horizontal  and  vertical  splitting, 

Vq  =  Ver(vq )  +  Hor(vq), 

where 

Ver(vq)  =  [ Adh(Q  +  u{x)  ■  vx))]Q(q)  =  [  0  ,  /*•(£  +  u(x)  •«*)].  (5.3.3) 

and 

Hor(vq)  =  vq-  Ver(vq )  =  [vx,  -h  •  (u(x)  ■  vx)].  (5.3.4) 

Indeed,  (5.3.3)  can  be  carried  out  directly  as  follows, 

Ver(vq)  =[Adh((  +  v(x)  •  vx)\Q(q) 

$(exp{eAdh((  +  w(x)  •  u^)},  (®,  h )) 

€=0 

[x,  exp{eAdh((  +  u(x)  ■  ux)}  ■  h\ 

6  =  0 

[x,  h  ■  exp{e((  +  u(x)  ■  ur)}] 

e=0 

=[0,fc-(£  +  w(x)  •«,)]. 

Now  we  express  each  object  in  Theorem  5.2.5  based  on  the  above  setting.  First, 
we  introduce  the  induced  Riemannian  metric  and  potential  function  on  base  space  B  in 
accordance  with  the  restriction  of  Riemannian  metric  and  potential  function  of  a  simple 
mechanical  system  with  symmetry  on  the  horizontal  subspace.  Let  Hr  be  the  horizontal 
subspace  of  TrQ  at  r  =  (x,h)  with  respect  to  the  above  connection.  If 

vr  =  [vx,  —h  ■  (w(x)  •  vx)]  G  Hr  and  wr  =  [wx,  -h  ■  (uj(x)  ■  wx )]  G  Hr, 


d_ 

de 

d_ 

de 

d_ 

de 
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then 


K(r)(vr,wr )  =K(x,h)([vx,  -h  ■  (u>( x)  ■  vx)],[wx,-h  ■  (u(x)  ■  tu*)]) 

=K(x,  e)([vx,  -u(x)  ■  vx],  [wx,  -u(x)  ■  wx ]) 

=  K(x)(vx,wx),  (5.3.5) 

where  e  is  the  identity  element  in  G,  and  the  G -invariance  of  K  has  been  used.  Also, 
since  the  potential  function  on  Q  is  G -invariant, 

V{r)  =  V(x,  h)  =  V(x,  e)  =  V(x).  (5.3.6) 

The  constraint  in  Constraint  Hypothesis  5.2.1  based  on  the  connection  given  in 
(5.3.2)  now  has  the  following  expression 

cj(z,  h)  ■  V(x<h)  =  Adh{ C  +  u(x)  ■  vx)  =  £(x,  h )  =  Adh{( x,  e) 

or  equivalently, 

£(*)  =  f(«»e)  =  C +  £(*)•  vx.  (5.3.7) 

Then,  for  any  q  =  (x,h)  £  Q , 

K(9)]q(?)  =  [Adh^(x)]Q(q)  =  [0,  h  •  £(*)],  (5.3.8) 

and  then,  the  second  term  of  V *  in  (5.2.23)  at  q  becomes 

\  K(q)mqMq),  K(?)]q(«))  K{x,  h)([ 0,  h  ■  £(*)],  [0,  h  ■  «*)]) 

K (x,e)([0,f(x)],[0, £(«)]) 

=  i(I(x)e(x)J(x)), 

where  I(a:)  =  l(x,e). 

Using  the  above  newly  defined  objects  on  base  space  B ,  the  function  L £  in 
Theorem  5.2.4  can  be  expressed  as, 

L*(r,  tv)  =  \  K(x)(vx,  vx)  -  (U(z)  +  I  <I(*)£(*),  £(*))) 

2_  2  (5.3.9) 

=  L*(x,vx), 

for  any  point  r  =  (x,  h)  £  Q  and  vr  =  [nx,  -h  ■  (u(x)  •  vx)}  G  Hr  C  TrQ  . 
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We  are  now  ready  to  drop  the  dynamics  given  in  (5.2.25)  down  to  the  base  space  B . 
Let  q(-)  =  (x(-),/i(-))  be  the  solution  of  (5.2.5)  with  initial  condition  g(0)  =  ( xQ,ho )  for 
xo  £  B  and  h0  £  G ,  where  x(-)  is  naturally  the  projection  of  q(-) .  Let  r(-)  =  (x(-),g(-)) 
be  the  horizontal  lift  of  x(-)  with  initial  condition  r(0)  =  (x(0),g(0)),  where  g(-)  is 
determined  by  solving  differential  equation 

g(t)  =  - g{t )  •  (w(x(i))  •  vx{t ))  (5.3.10) 

for  initial  condition  g(0).  Then  from  (5.3.4), 

vr(t)  =  [ur(t),  -g(t)  ■  (u(x(t))  ■  i7x(i))].  (5.3.11) 

Remark  5.3.1:  From  connection  theory,  we  know  that,  given  a  connection  on  a 

principal  fiber  bundle  p  =  there  exists  an  unique  horizontal  lift  mapping 

at  each  point  q  in  Q ,  denoted  by 

■ft(?)  :  T„(q)  TqQ. 

For  trivial  bundle  case,  this  mapping  has  been  given  explicitly  in  (5.3.4),  i.e., 

vr  =  H(x,  g)  ■  vx  =  [vx(t),  —g{t)  ■  (u>{x(t))  •  vr(i))]. 

We  will  use  this  notation  in  the  next  section.  ■ 

Let  ur  be  any  tangent  vector  at  r(t)  in  Hr,  which  can  be  represented  by 

ur  =  [%,  ~g(t)  •  (w(x(f))  •  ux)] 

for  some  ux  £  TXB .  Now,  the  first  term  on  the  left-hand-side  of  (5.2.25)  is 

d  d 

—  D2Li(r,vr)uT  =  —K(r(t))(vT(t),ur) 

=  7-  K(x(t),  g(T))([vx(T),  —g{r )  •  (w(x(r))ur(r))],  [ux,  -g{t)  ■  (u(x(t))ux)}) 

dT  r=t 

=  jtK Xx(t),g(t)){[vx(t),  -g(t)  ■  (u(x(t))vx(t))l  [ux,  —g(t)  ■  (£(x(<))OD 

-  K(x,  g)([vx,  -g  ■  (w(x)t7r)],  -g(t)  •  ( Z(x(t))ux )]) 

=  ^-K(x(t),  e)([ux(t),  -u(x(t))vx(t)],  [ux,  -u(x(t))ux]) 
at 

-  K(x, g)([vx,  -g  ■  (w(x)ur)],  [0,  g  ■  (u(x)vx)(u(x)ux)  -  g  ■  ((Dxu(x)vx)ux)]) 

d  _ 

=  A  (a:)(var(i),  %) 

-  K(x,e)([vx,-ui(x)vx\,[ 0,(u(x)vx)(u>(x)ux)  -  (Dxu{x)vx)ux]).  (5.3.12) 
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Remark  5.3.2:  In  the  above  derivation  we  used  the  following  convention, 

g-eem-eXr1*,  ¥771,772  €  G,  Vg£G. 


A  d 

griir)2  = 


de 


e=0 


d 

dX 


\=o 


The  second  term  in  the  left-hand-side  of  (5.2.25)  is 


DiL^(r,  vr )  ■  ur  =-  ( DrK{r )  •  ur)(vr,vT)  -  DrV(r )  •  uT 

~  \  Dr(K(r)([Z(r)]Q(r),  fc(r)]<j(r)))  •  uT 


where 


(DrK(r)-ur)(vr,vr)  =  — 


K(r  +  eur)(vr,vr) 


e=0 


d_ 

de 

d_ 

de 


(5.3.13) 


K(x  +  eux,ge 

) 

K(x  +  eux,e)([vx,  (2(x)vx)],  [vx,  (S5(*)t>x)]) 

) 

=(DxK(x)ux)(vx ,  vx)  -  2 K(x,  e)([vx,  -u(x)vx\,  [0,  -(DJ2(x)ux)vx]) 

+  2 1((x,  e)([vx,  -u(x)vx],  [0,  -(c^x^Xa^x)^)]),  (5.3.14) 


6=0 


6=0 


and 


DrV(r)  •  u 


_d_ 
r  ~de 
d 
de 
d_ 
de 


e=0 


V(r  +  eur) 

) 

V(x  +  eux,ge~eZ^) 


e=0 


V(x  +  eux,e) 

u=o 
=  D XV (x)  *  ux 


Dr(K(r){[Z(r)}Q{r),  K(r)]g(r)))  •  ur 
d 


de 

d 

de 

d_ 

de 

d_ 

de 


^(^)(^(r6)]g(r£),[^(re)]g(re))|  r€=r+eur 


6=0 


(5.3.15) 


K(x+eux,  ge~^u*  )([0,  ge~^u*  Z(x  +  eux)},[0,  ge~^u* ((x- +eux)\) 


6=0 


K (x  +  eux,  e)([0,  £(x  +  eux)],  [0,  f(x  +  eux)]) 


6  =  0 


(I(x  +  enr)^(x  +  eux),(;(x  +  eux)) 


6  =  0 


=((iyr(x)  •  «a;)f(x),  £(x))  +  2{I(x)£(x),  (Dx£(x))  •  ux). 


(5.3.16) 
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Therefore,  from  (5.3.12)-(5.3.16),  the  left-hand-side  of  (5.2.25)  is 

^-D2Z^(r,  vT)  ■  ur  -  DiL*(r,  vr)  ■  ur 
at 

=  i-K(x(t))(vx(t),ux)  -  l  {DxK{x)ux){ vx,vx)  +  DxV{x)  ■  ux 
at  l 

+  \  ((DxI(x)-u(xMx),ax))  +  <I(*)|(*)t(i)*f(*))  •  ux) 

-  I<(x,e){[vx,  -u(x)vx\,  [0,  (DJj{x)ux)vx  -  ( Dxu>{x)vx)ux 

+  (w(x)t>.,.)(2(®)u«)  -  (w(®)tt*)(2(® )«,)]) 

=  4-jD2Z^(x,  vx)  ■  ux  -  D\L(‘{x,vx)  ■  ux 
at 

-  K(x,e)([vx,  -w(x)w2.],[0,n(x)(tta:,ur)]) 

-~D2Li(x,vx)  ■  ux  -  DiLi(x,vx)  • ux  -  r(x)(va;,'ua;,na;),  (5.3.17) 

where 

tt(x)(ux,  vx)  =  {Dxu(x)ux)  ■  vx  -  (Dxu(x)vx)  ■  ux 

+  (u(x)  ■  vx){u{x)  ■  ux)  -  (w(x)  •  ux)(u{x)  ■  vx) 

=du(x)(ux,  Vx)  [  cj(x)  •  ux,  w(x)  ’ViJ  (5.3.18) 

and 

r(x)(va.,  ux,  vx)  =  K(x,e)([vx,-u(x)  ■  vx],[0,fl(x)(ux,vx)}).  (5.3.19) 

Among  the  newly  defined  objects  above,  r(x)(va:,,ur,t;x)  is  a  (3,  0)-tensor  on  B ,  which 
is  skew-symmetric  in  last  two  vectors.  And,  ft  is  a  5-valued  two-form  on  B,  which 
plays  the  role  of  local  curvature  form.  Indeed,  we  have  following  result. 

Proposition  5.3.3:  Let  r  =  (x,g)  be  any  point  in  Q  for  any  point  x  6  B  and  g  €  G, 
and 

uf  =  («*,  -g  ■  (2(5)  ■  «*))  and  vr  =  («*,  -g  ■  (2(5)  •  t>*)) 

be  horizontal  tangent  vectors  with  respect  to  the  connection  given  in  (5.3.2).  Let  ft  be 
the  corresponding  curvature  form  of  the  connection.  Then  we  have 

[ft(f)(ti?,  vf )]Q(f)  =  (0 ,g-  &(*)(“*» »*))•  (5.3.20) 

Proof:  Recall  that,  if  U  and  V  are  horizontal  vector  fields  on  Q  , 

mq){U(q),V(q))}Q(q)  =  -Ver([  U,  V  ](<?))•  (5.3.21) 
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We  start  from  computing  [U,V],  Let  u(x)  and  u(x)  be  vector  fields  on  B  extended 
respectively  from  uf  and  vs,  i.e.,  u(x)  =  ux  and  u(x)  =  vx.  Then  we  define  two 
horizontal  vector  fields  on  Q  to  be,  for  any  r  =  (x,g)  £  Q , 

U(r)  =  [u(x),-g-{u(x)-u(x))]  and  V(r)  =  [u(x),  -g  ■  (£(x)  •  u(x))].  (5.3.22) 

Let  4>u{t,r)  and  cf>v(t,r)  be  the  tangent  curves  of  U  and  h  at  r,  respectively,  i.e., 
<j>u(t,r)  =  (x  +  tu(x),g-  and  <fa(t,r)  =  (x  +  tv(x),g-  e-M*M*)) 

Then,  using  Lemma  4.4.1, 


[V,V](r)=  Jt 


V((j>u(t,r))  —  U((j)v{t,r )) 


t= o 


d_ 

dt 


[v(x  4-  tu(x)),  -ge  +  fu(x))u(x  +  fw(x))] 


t=o 

d_ 

dt 


[u(x  +  tv(x)),  —ge  tu^x">v^ui(x  +  tv(x))u(x  +  tu(x))] 


4=0 


=[Dv(x)u(x),g(u(x)u(x))(u(x)v(x))  -  g(Du>(x)u(x))v(x)  -  gw(x)(Dv(x)u(x))] 

—  [Du(x)v(x),g(u(x)v(x))(u(x)u(x))  +  g(Du(x)v(x))u(x)  +  gu>(x)(Du(x)v(x))] 
=[Dv(x)u(x)  —  Du{  x)v(x),  —gu(x)(Dv(x)u(x)  —  Du(x)v(x))] 

+  [0,  <7((w(x)u(x))(u>(x)u(x))  —  (w(x)v(x))(ui(x)u(x)) 

+  (Du>(x)v(x))u(x)  —  ( Duj(x)u(x))v(x ))] 

=[Dv(x)u(x)  —  Du(x)v(x),  —g  •  u(x)(Dv(x)u(x)  —  Du(x)v(x))\ 

-f  [0,  g  ■  ([  u}(x)u(x),cj(x)v(x)  ]  —  du)(x)(u(x),  u(x))].  (5.3.23) 


It  is  clear  that  in  (5.3.23)  the  first  term  is  the  horizontal  part  of  [  Z7,  V  ](r)  and  the 
second  term  is  vertical  part  of  [  U,  V  ](r)  which,  at  ( x,g ),  is  identical  to  the  minus  of 
right-hand-side  of  (5.3.20).  | 

Remark  5.3.4:  If  one  assumes  G  acts  on  Q  to  the  right  in  the  definition  of  principal 
fiber  bundle,  f 1  is  of  the  form 

f l(x)(ux,  vx)  =  du(x)(ux ,  vx)  +  [  iB(x)(ux),  u j(x)(vx)  ]  , 


which  is  called  local  curvature  form  in  many  textbooks  in  physics  and  mathematics.  | 
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We  now  start  to  drop  the  right-hand-side  of  (5.2.25)  down  to  the  base  space  B . 
The  first  term  there  can  written  as,  using  Exterior  Force  Hypothesis  5.2.3, 

F(r,v r  +  [£(r)]Q(r))  •  ur  =F((x,g),  [»*,$(£(»  -  u(x)  ■  vx])  •  [ ux ,  -gZ(x)  •  ux } 

=  F(( x,  e),  [vx,  £(x)  -  u{x)  •  vx])  •  [ux,  -w(x)  ■  ux] 

=  F(x,vx)-ux.  (5.3.24) 

We  give  the  drop  of  the  second  and  third  terms  on  the  right-hand-side  of  (5.2.25) 
in  the  following  Lemmas. 

Lemma  5.3.5:  Let  f  =  (x,g)  be  any  point  in  Q  for  any  point  x  G  B  and  g  G  G ,  and 


Uf  =  (ux,  -g  •  (w(x)  •  ux))  and  vf  =  (%,  -g  •  (w(x)  •  vx)) 
be  horizontal  tangent  vectors  at  r  with  respect  to  the  connection  given  in  (5.3.2).  Then, 
duz(r)(ur,  Vf)  =du^(x)(ux,  vx)  -f  (I(x)f(x),  Q(x)(ux,  vx)),  (5.3.25) 

where  is  a  one-form  on  B  defined  by,  for  any  z  6  B  and  wz  6  TZB , 

ui{z)(wz)  =  K(z,  e)([0,  £(*)],  [w2,  -u(z)wz]).  (5.3.26) 

Proof: 

As  we  have  done  in  the  proof  of  Proposition  5.3.3,  let  U  and  V  be  the  vector  fields 
extended  from  tangent  vectors  uf  and  vf ,  respectively,  such  that  at  any  point  r  in  Q , 
their  expressions  are  given  in  (5.3.22).  From  (4.1.3), 

duz(r)(U(r),V(r))  =  tf(r)[wc(r)F(r)]  -  V(r)[u^r)U(r)]  -  w€(r)([  U,  V  ](r)).  (5.3.27) 


From  the  definition  of  u ^  in  Theorem  5.2.5, 

ut(r)V(r)  =  A'(r)([^(r)]g(r),  V(r)) 

=  K(x,g)([0,g-  f(x)],  [v(x),  -g  ■  (w(x)  •  v(x))]). 

Then,  letting  (f>u(€,r)  =  [x  +  eu(x),£re_e^x^^^]  be  the  tangent  curve  of  U  at  r  =  (x,g), 
we  have 


U(r)[u>^(r)V(r)} 


d_ 

de 


^(d>c/(€,r))Vr(d>!7(e,r)) 

e=0 
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[v(x  +  eu(x)),  -ge  eu(~x)u(x')^x  +  eu(x))v(x  +  eu(x))}) 

=  K(x  +  €u(x),e)([0,J(x  +  eu(x))], 

®e  e=0 

[u(x  +  eu(x)),  — £>(x  +  eu(x))v(x  +  eu(x))]) 

=  D(c3f(x)  •  u(x ))  •  v(x ) 

+  K(x,  e)([0, ?(*)],  [Dv(x)  ■  u(x),  -u(x)(Dv(x)  •  «(*))]).  (5.3.28) 

Following  the  same  way,  we  get 

V(r)[u>t(r)U (r)]  =  D(u^(x)  ■  v(x))  •  u(x) 

+  iif(®,e)([0,f(x)],  [Du(x)  ■  v(x),  -u(x)(Du(x)  •  «(*))]).  (5.3.29) 

The  Lie  bracket  of  vector  held  U  and  V  has  been  computed  in  (5.3.23).  Then 

ue(r)<{U,V](r)) 

=  iiT(x,e)([0,£(x)], 

[. Dv(x )  •  u(x)  —  Du(x)  ■  v(x),  —u(x)(Dv(x)  ■  u(x)  —  Du(x)  ■  v(x))]) 

+  Z(x,e)([0,^(x)],[0,?2(x)(u(x),u(x))]).  (5.3.30) 

Substituting  (5.3.28)-(5.3.30)  into  (5.3.27),  we  get 

doj^(r)(U(r),V(r))  =  cfw5(x)(tt(x),  v(x))  +  (I(x)f(x),  fi(x)(u(x),  v(x))). 

Evaluating  the  above  equation  at  r  =  f,  we  get  (5.3.25).  I 

Lemma  5.3.6:  Let  r  and  uf  be  defined  as  in  Lemma  5.3.5.  Let  [£(h)]g(f)  =  [0,  p-£(x)] . 
Then, 

du^(f)(ur ,[^(r)]g(f))  =  {(Dz I(x)  •  uz)£(x),  f(x))  +  (I(x)f(x),  Dx£(x)  ■  ux) 

+  (  I(*)J(*),[w(x)tt*,^(x)  ]  ).  (5.3.31) 

Proof: 

Let  U  be  the  same  vector  held  given  in  (5.3.22).  Let  be  the  vector  held  extended 
from  £q(x)  such  that,  at  any  point  r  in  Q  , 


6}(0  =  [Adg£(x)]Q(x, g )  =  [0,  Ad-gt(x)  ■  g]. 


Again,  from  (4.1.3),  we  have 


<L}£(r)(U(r),ZQ(r))  =  F(r)[w€(r)fg(r)]-fQ(r)[«€(r)l/'(r)]-fa;€(r)([i7,fQ](r)).  (5.3.32) 
Let 

4>u{*,  r)  =  [x  +  eu(x),  and  <j>gQ  (e,  r)  =  [x,  e€Ads^x)g] 

be  tangent  curves  of  vector  fields  of  U  and  £q  at  (x,g),  respectively.  Then  following 
the  same  procedure  as  we  have  done  in  the  proof  of  previous  Lemma,  we  have 

U (r)[w* (r)(£j(r))]  =  r))(£Q(<f>u)(et  r)) 

ae  e=0 

=  ~  K(x  +  €u(x),ge-^x'>u^)([0,ge-€Z^x'>£(x  +  eu(x))], 

<e=o 

[0  ,Ad^{x)ge~^x^\) 

=  X  K(x  +  eu(z)>e)([0>£(x  +  «“(*))]» 

d*  £— ~o 

[0  ,e^(x)u(x)g-lAd-al{x)ge-^x)u^]) 

-  (L>I(x)f(x),  Adg- ijf(x))  +  (I(x)J5f(*)  •  «(*),  Adg-i-g£(x)) 

+  (I(*)f(*),  (w(x)u(x))Ada-i3|(x)  -  Adg-ig£(x)(uj(x)u(x)))  (5.3.33) 

and 

£Q(r)[ui(r)U{r)]=  ^d^Q((,r))U(<p^Q{€,r)) 

°€  £  =  0 

=  y  ii:(x,c^.-1«*^)([0,c<^.-*«*)^*)],[tt(x),-ee^.-1«*)fira;(x)tt(x)]) 

e=0 

=  0.  (5.3.34) 

Finally,  using  Lemma  4.4.2,  we  have 

[U,£q](r)=  £q{.4>u(€,t))-  U(<j>zQ(e,r)). 

Ue  e=0 

But 

f  Zq(M<))  =  T  [0,  Adg-xl{x)ge-^x^) 

ae  e=0  ae  €=0 

=  [0,-srAd5-i5^(x)(w(x)u(x))] 

and 

=  4 T  [u{x),-eeAd^x)g{Zi{x)u{x))\ 

ae  e=0  ae  f=0 

=  [0,-fifAd5-i5^(x)(S(x)u(x))]. 
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Therefore 


[U,£q  ]  =  0.  (5.3.35) 

Substituting  (5.3.33)-(5.3.35)  into  (5.3.32)  and  evaluating  it  at  r  =  f,  one  gets  (5.3.31). 

I 

Finally  the  last  term  in  (5.2.25)  can  be  given  as  follows.  First,  at  any  point 
r  =  (x,g)  in  Q, 

A  I 

£(r  +  euT) 


_  d 

-  CL 

de 


€  =  0 


g  .  ^  +  ^  .  eMx)«*  5-i 


e=0 


=  17  •  (-(w^u^x)  +  ^(x)(w(x)ua:)  +  Dx£(x)  ■  ux)  ■  g 

=  S-([4(:c),£(z)u;r]  +  Dx£(x)  •  Ux)  -g'1 
which  is  a  (/-valued  vector.  Then 


-l 


[D£(r)  ■  ur]Q(r)  =  [0,ff  •  ([4(x),w(x)ux  ]  +  Dx£(x)  ■  «*)]. 


Therefore, 

H(r)(ttr,vr)  =  K(x,  g)([vx,  -g(u(x)vx)],[0,  g  ■  ([f;(x),L>(x)ux]  +  Dx^(x)  •  «*)]) 

=  K(x,  e)(K,  -(w(x)u,.)],  [0,[f(x),w(x)ur  ]  +  Dx^(x)  •  tt*]) 

=  E(x)(ux,vx).  (5.3.36) 


By  collecting  (5.3.17),  (5.3.24),  (5.3.25),  (5.3.31)  and  (5.3.36),  Theorem  5.2.5  leads 
to  the  following  result. 

Theorem  5.3.7:  If  q(- )  =  (x(-),g(-))  is  a  solution  of  constrained  dynamic  equations 
(5.2.5)  on  principal  fiber  bundle  (B  X  G,B,tt,G)  with  connection  in  (5.3.2),  then  its 
projection  x(-)  in  B  satisfies  unconstraint  equation 


dt 


D2L^(x,  vx)  ■  ux 


D\L  (x,  u^,)  *  ux  — F(x,  vx )  •  ux  T  du^^(x)(ria;,  vx ) 

+  T(x)(ur,  ux,  vx)  +  (I(x)f  (x),  Sl(x)(ux,  vx)) 


+  ((ArI(*)  •  «*)£(*),  4(*)) 


+  (I(z)4(z),-Dx4(x)  •  ux) 

+  (I(x)4(*),[w(x)ux,f(x)]) 


-  E(x)(ux,vx) 


(5.3.37) 
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for  any  ux  6  TXB ,  where 

LHx,vx)  =  iA'(x)K,Wl)-(y(x)+i{f(xK(x)J(x))) 

fi(x)(ux,  vx )  =  du>(x)(ux,  vx )  -  [  u>(x)  •  ux,  uj(x)  •  ] 

r (x)(vx,ux,vx)  =  K(x,  e)([vx,  — u>(x)  •  vx\,  [0,  £l(x)(ux,  n^)]), 

E (x)(ux,vx)  =  K(x,e)([vx, -(5(x)n*)],  [0,[£(x),  ^(x)?^  ]  +  Dx£(x)  ■  ux\) 

and  F  and  are  defined  in  (5.3.24)  and  (5.3.26),  respectively. 

Remarks  5.3.8: 

(1)  If  we  define  a  function 


L(x,vx)  =  ^  K(x)(vx,vx)  -  V(x), 


by  considering  (5.3.16),  the  dynamic  equation  (5.3.37)  can  be  replace  by 


—D2L(x,vx)  •  ux 


D1L(x,vx)  ■  ux  =F(x,vx)  ■  ux  +  duj^{x)(ux,  vx) 

-)~  r(x)(nI,  ux ,  vx )  T  (H(^; )^(a- )>  ^(®)(^ar>  ^r)) 
+  \  ((Dxl(x)  ■  ti*)£(x),  l(x)) 

+  (I(x)f(x),[w(x)ux,^(x)]) 

-E  (x)(ux,vx).  (5.3.38) 


(2)  If  £(x)  =  0,  Equation  (5.3.37)  becomes 

d 

I-) 2 *  Ux  D\L[x ,  Vx )  *  —  ■^(•Ey  vx)  ’  ^ x  +  ^xi  ^zr)y  (5.3.39) 

which  is  the  equation  derived  in  [20]. 

(3)  As  long  as  x(-)  is  determined  from  (5.3.37)  or  (5.3.38),  its  horizontal  lift  r(-)  can 
be  found  from  solving  (5.3.10),  and  the  solution  of  original  constrained  equation 
(5.2.5)  can  be  found  by  using  the  formula  in  Remark  5.2. 6(3).  This  procedure  is 
called  reconstruction  in  [26]. 

(4)  If  one  assumes  G  acts  on  Q  on  the  right  in  the  definition  of  principal  fiber  bundle 
and  simple  mechanical  system  with  symmetry  and  the  mapping  £  :  Q  —*  Q'  defined 
in  Constraint  Hypothesis  5.2.1  is  G-equivariant  with  respect  to  right  action,  i.e., 


138 


£(q-g)  —  Adg-i£(q),  then  to  obtain  the  reduced  dynamic  equation,  one  only  needs 
to  change  signs  in  front  of  all  the  Lie  brackets  in  (5.3.37)  or  (5.3.38). 

(5)  One  can  check  that  if  G  is  Abelian,  (5.3.37)  or  (5.3.38)  specializes  to  (5.1.9).  | 


5.4  Lagrangian  Reduction 

In  Section  5.2,  we  formulated  constrained  Lagrangian  dynamics  on  horizontal 
distribution  for  any  principal  connection,  by  which  the  nonholonomic  constraints  are 
constructed  (cf.  Constraint  Hypothesis  5.2.1).  Here,  we  consider  a  special  connection, 
namely,  mechanical  connection  which  has  been  considered  in  Section  4.1. 

Recall  that,  given  a  simple  mechanical  system  with  symmetry  (Q,  K,V,G)  with 
its  Lagrangian  given  in  (5.2.2),  the  lift  of  the  G-action  to  TQ  induces  an  equivariant 
momentum  map  J  :  TQ  —>■  Q*  given  by 

(J(9,w9),0  =  K(q)(vq,CQ(q)),  vc  eg.  (5.4.1) 

We  assume  that  the  exterior  force  acting  on  the  system  satisfies  (5.2.6)  and  leaves  the 
momentum  map  invariant,  i.e.,  the  motion  of  the  system  q(-)  =  {q(t),t  >  0}  satisfies 
J(qq  <i)  =  l1  for  a  constant  /x  €  Q* .  If  /x  is  a  regular  value  of  J ,  the  interesting  question  is 
what  the  dynamics  on  subspace  S  —  J-1(/x)  C  TQ  is.  In  Section  5.1,  we  have  considered 
the  same  problem  by  treating  the  conserved  momentum  map  as  the  constraints  of  the 
system  with  Abelian  symmetry.  Here,  although  the  symmetry  group  is  non- Abelian  in 
general,  we  will  follow  the  same  idea  as  we  had  in  Section  5.1,  but  using  the  geometric 
method  we  developed  in  Sections  5.2  and  5.3. 

Consider  a  principal  fiber  bundle  given  by 


P  =  (Q,5,7T,G^), 


(5.4.2) 


where  =  {g  €  G  |  Ad*_x/z  =  fi} ,  for  constant  /j,  £  Q*  given  above,  is  an  isotropy 
group.  Let  be  the  Lie  algebra  of  G ll.  As  we  have  shown  in  Section  4.1.2,  on  this 
bundle,  the  mechanical  connection  is  given  by  a  -valued  one-form: 

oj(q)  :  TqQ 


vq  ^  IMI(<?)J(?^9),  VqSQ, 


(5.4.3) 
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where  IM(g)  :  —>  £/*  is  called  //-locked  inertia  tensor,  which  is  defined  in  the  same 

way  as  we  did  in  (4.1.9),  but  restricted  on  Q ^ .  Now  the  subspace  S  —  J_1(/x)  can  be 
represented  as 

s  =  {  (?,*,)  G  TQ  I  u,(g)(t;,)  =  },  (5.4.4) 

which  is  of  the  same  form  as  (5.2.4)  in  Constraint  Hypothesis  5.2.1.  Indeed,  for  any 
9  G  Qfi  5 

1^(5  •  q)p  =  I ■  q)Ad*gfx  =  I ~\g-  q)Ad*3(q,vg ) 

=  2^(5  •  •  (?, v?))  =  w(5  •  q)(Tq$g  •  t»,) 

=  Adgu(q)(vg)  =  ,4d5  I^X(?)A*- 

With  the  above  setting,  formulation  of  dynamic  equation  on  J-1(//)  can  be  given 
by  (5.2.5)  with  £(q)  —  I ~1(g)//.  Following  Theorem  5.2.4,  we  can  also  write  down  the 
dynamics  of  the  constrained  system  on  the  horizontal  distribution.  Since  the  connection 
now  is  a  special  one,  Equation  (5.2.25)  can  be  simplified.  We  first  show  the  following 
results. 

Proposition  5.4.1:  In  Theorem  5.2.5,  if  £(g)  =  I~1(q)n, 

(1)  the  function  V^(q)  is  the  amended  potential  function  V^(q) ; 

(2)  the  one-form  is  the  /z -component  of  mechanical  connection  form; 

(3)  For  mechanical  connection, 

H(r)(v)sO. 


Proof:  (1)  Indeed, 

vHq)  =v(q)  +  \  K(q)([Kl(q)ph(q)^K1(q)ph(q)) 

=v(q)  +  \{Mq)-K1(q)^K\q)p) 

=v(q)  +  \(^  K'(q)p) 

=  V,(q). 

(2)  The  //-component  of  a  A; -form  ak  E  ruk(Q-,Q^)  is  a  real-valued  A; -form  on  <5, 
denoted  by  ak ,  and  is  defined  by 

(«£(?)»  w»>  =  ( n,ak(q)(vq )).  (5.4.5) 
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From  the  definition  of  mechanical  connection  given  in  (5.4.3),  we  have 


{P,u(q)(v ,))  =  (/i,I"1(g)J(^,n9)) 

=  flt1  («)/*»  J(«»«g)> 

=  K(q)((T^(q)ft)Q(q)tvq) 

Therefore,  from  (5.4.5),  the  //-component  of  the  mechanical  connection  form  is 

«,(»)  =  (5-4.6) 

From  the  condition  of  this  Proposition  and  the  expression  of  u ,  (2)  is  proved. 

(3)  Since  horizontal-vertical  splitting  of  mechanical  connection  is  an  orthogonal 
splitting  with  respect  to  the  Riemannian  metric  K,  the  claim  follows  from  the  definition 

°f  “(r)(‘,  •)  •  ■ 

After  identifying  as  //-component  of  mechanical  connection  form  coM ,  we  have 
the  following  properties. 

Proposition  5.4.2: 

(1)  For  a  fixed  point  q  G  Q  and  any  vector  field  Y  G  ®(Q), 

d^(q)mq))Q(q),Y(q))  =  0;  (5.4.7) 

(2)  du,j.  is  the  //-component  of  the  curvature  form,  of  the  mechanical  connection. 
Proof: 

(1)  First,  we  extend  the  tangent  vector  [£(<?)]q(?)  to  a  fundamental  vector  field  (at  q) 
£q(q),  where  f  =  £(q).  Then  we  have  formula 

<^;m(|q,y)  =  £q  -wM(y)  -  y  -wm(6q)  - ^M([|g,y  ]) 

=  (/i,|Q  -o;(y))  -  (//,y  •«(&)>  -  (/*,«([  fQ,y  ])).  (5.4.8) 

The  second  term  in  (5.4.8)  is  always  zero  since  co(£q)  =  £  is  a  constant  in  ^ .  If  Y 
is  horizontal,  so  is  [  £q,Y  ]  (cf.  [33]).  This  implies  oj(Y)  —  co([  {;q,Y  ])  =  0,  and 
consequently  dwM(£g,y)  =  0.  If  Y  is  fundamental,  say  Y  =  Cq  f°r  some  (  6  the 
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first  term  of  (5.4.8)  is  zero  since  (j(Y)  is  a  constant.  In  addition, 

{pMitQiY]))  =  -(/*,w(U,C]fl)) 


d_ 

dt 

d_ 

dt 

d_ 

dt 


AdeXp(t£)C) 

1=0 

(Ad*exp(t0v,C) 

1=0 

<M,C)  =  0. 

1=0 


This  completes  the  proof  of  (1). 

(2)  From  the  structural  equation  in  (4.1.4),  we  have 


=  <jjLt[u(X)MY)])  +  (n,MX,Y)). 

But  (/i,[cj(X), u(Y)  ])  =  0  by  the  proof  of  (1)  above.  This  proves  the  result.  g 

Now  using  the  above  observations,  we  can  restate  the  Theorem  5.2.5  for  the  systems 
with  mechanical  connection  as  follows. 

Theorem  5.4.3:  If  q(-)  is  the  motion  of  a  Lagrangian  system  with  symmetry  and 
preserves  the  equivariant  momentum  map  J(q,  vq)  —  /x,  then  its  restriction,  r(-),  on  the 
horizontal  distribution  determined  by  the  mechanical  connection  satisfies  the  dynamic 
equation 


—D2Lll(r,vr)-ur-DiLtl(r,Vr)-ur  =  F{r,vr  +  [ IM  (r)]Q(r))-ur  +  flM(r)(ur,nr),  (5.4.9) 


where 

Lf,(r,vr)=  ^K{r)(vr,vr)  -  (V(r)+  |(M»IjT1(r)M))  (5.4.10) 

and  is  /i -component  of  the  curvature  form  of  the  mechanical  connection.  ■ 

Having  dynamic  equation  on  the  horizontal  distribution,  we  intend  to  drop  the 
dynamics  of  the  system  down  to  the  base  space  Q/G To  this  end,  we  first  define  a 
lift  mapping  from  curves  in  base  space  to  curves  in  total  space.  Let  q(-)  =  {q(t),t  >  0} 
be  a  curve  in  Q  and  x(-)  be  the  bundle  projection  of  q(-),  i.e.,  x(-)  =  7r(g(-)).  Define  a 
mapping  (cross-section) 

P-Q/G^Q  (5.4.11) 
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such  that  r(-)  =  p(x(-))  is  a  horizontal  curve  in  Q ,  i.e.,  at  each  t  =  t, 

vr(t)  =  H(p(x(t)))  ■  x(t)  =  Tx(f)p(x(i))  •  x(t)  €  Hp(l(i)).  (5.4.12) 

If  the  principal  bundle  is  a  trivial  one,  the  mapping  'H(r)  has  been  given  explicitly  in 
Remark  5.3.1.  In  particular,  one  can  show  that  now  the  pull-down  connection  form  S(x) 
is  of  the  form:  I_1(a:)j(x),  where  the  map  j(x)  has  been  defined  in  Subsection  4.1.2. 

With  the  above  defined  mappings,  one  can  define  the  induced  Riemannian  metric, 
potential  function  and  p -locked  inertia  tensor  on  the  base  space  B  =  Q/G^  as  follows. 

K(x)(vx,wx)  =  K(p(x))(H{p(x))  ■  vx,H(p(x))  ■  wx)  Vvx,wx  £  TXB, 


(5.4.13) 

y(x)  =  (i70p)(*), 

(5.4.14) 

IM(x)  =  (I^op)(x). 

(5.4.15) 

Remark  5.4.4:  The  above  objects  are  well  defined  since  they  all  are  -invariant.  | 
With  these  newly  defined  objects  in  B ,  the  Lagrangian  L M  in  (5.4.10)  can  also  be 
induced  on  B: 

L»(r,vT)  =Lfl(p(x),H(p(x))-vx) 

=  \k(p(x))Wp(x))  ’  vx,n(p{x))  •  Vx) 

-(V(p(x))+1-(p,I-,\p(x))p)) 

=  t;K(x)(vx,vx)  -  (F(x)  +  ( X)P )) 

=  L,(x,vx),  (5.4.16) 

which  is  a  function  defined  on  T{Q/G^). 

Now  we  are  ready  to  express  (5.4.9)  on  the  base  space  Q /G^ .  Let  ur  =  ?i(p(x))-ux 
for  any  ux  £  TXB .  Then,  the  first  term  on  the  left-hand-side  of  (5.4.9)  is 

<^D2Lll(r(t),vr(t))  -ur  =  ^ K(r(t))(vr(t),ur ) 

=4-  K(p(x(t))){-H(p(x(t)))  ■  vx,  H{p{x{t)))  ■  ux) 

dr  T =t 

=  ^^(x(t))(ux(t),  ux)  -  K(p(x))(H(p(x))  ■  vx,  ^ H(p(x{t )))  •  ux) 

x,vx)  ■  ux  -  [/f(r)(ur,  (Dq'H(r)  ■  vr )  •  ux)]\r>Vr,  (5.4.17) 


143 


where  [  •  ]|r)Ur  =  [  •  ]| r=p(x),vr=n(p(x))-vI  ■  The  second  term  of  (5.4.9)  can  be  expressed 
explicitly  as 

vr)  •  ur  —  —  [DrK(r)  •  txr  )(ur,  vr )  DrV  (r)  •  uT  ~  {/x,  (Z)r  (r)  •  ur)p) . 

Here 

d 

(DrK(r)  •  ur)(vr,vr)  =—  K(r  +  eur)(vr,vr) 

d(  e—0 

J 

=  —  K(p{x  +  eux))(H(p(x))  ■  vx,H(p(x))  ■  vx) 
ae  £=0 

[A"(x  4~  Vj) 

-  K(p(x))(H(p(x  +  eux))  ■  vr,H(p(x  +  eux ))  •  vr)] 

—  (*^)  *  '^tx){^xi  V X ) 

-  2[jRf(r)(vr,(I>,W(r)  •  xxr)  •  Ur)]|r,Ur,„r , 

where  [  •  ]\T,ur,vr  =  [  •  ]|r=p(*),ur=«(p(*)).t.„wr=w(/»(*))-«.  •  and>  following  the  same 
approach,  we  have 

DrV(r)  •  ur  =  DXV( x)  ■  ux 

)  *  Ux  — -  D XH fj'(x')  ‘  Ux. 

Then,  the  second  term  of  (5.4.9)  now  is  of  the  form 

DrL^(r,  vr)  ■  ur  —  D\L ^(x ,Vx)  •  ux  —  [A  (r)(nr,  (Dq7~l(r')  ■  ur)  vx)]\r<Ur  tVr .  (5.4.18) 


d_ 

dC  6  =  0 


From  (5.4.17)  and  (5.4.18),  the  left-hand-side  of  (5.4.9)  becomes 
^ 7^2 A^(r(t),'U,.(t))  •  tx,.  D rL •,  vr)  'XX,.  — 

D2  L  ^  (x ,  vx)  •  U/ x  (x,  nx)  rx^,  — {-*  F(x)('ixx,xix,'Ux), 

where 


(5.4.19) 


r(x)(n;c,xi;r,nx)  =  [A'(r)(nr,(Z?q7f(T-)  •  xxr)  •  -  (DqH(r)  ■  vr )  •  xxx)]|r,Ur>Vr . 

From  the  previous  section,  we  have  known  that  if  the  principal  fiber  bundle  is  a  trivial 
bundle, 

[(DqH(r)  ■  ur)  ■  vx  -  ( DqH(r )  •  vr)  •  xxx)]|r,Ur,„r  =  V er([H{p(x))  ■  ux,'H{p(x))  •  ux]). 
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Note  that  a  principal  bundle  is  locally  trivial.  Thus,  the  above  result  is  also  true 
in  general.  Since,  for  mechanical  connection,  the  horizontal-vertical  splitting  is  an 
orthogonal  one  with  respect  to  the  Riemannian  metric  K ,  immediately,  we  have 

r(a:)(t)2:,ua;,  vx)  =  0.  (5.4.20) 

Return  to  the  right-hand-side  of  (5.4.9).  Recall  that  is  the  /i-component  of 
curvature  form  of  the  mechanical  connection  defined  by 


ttn.(<l)(vq,wq)  =  (n,n(q)(vq,wq)),  \/vq,wq  €  TqQ. 

Since  fl  is  GM-equi  variant,  is  G -invariant.  Therefore,  using  the  lift  mappings  in 
(5.4.11)  and  (5.4.12),  we  can  define  a  two-form  on  base  space  B  by 

&M(x)(ux,  vx )  =  Q,IJ.(p(x))(H(p(x))  ■  ux,  H{p(x))  ■  vx).  (5.4.21) 


Finally,  using  the  G^ -equi  variant  property  of  the  exterior  force  (cf.  (5.2.6)),  we  can 
define  exterior  force  on  T5  by 

F(x,vx)-ux  =  F{p(x),n(p(x))  ■  vx  +  [i/ (x)p]Q{p(x)))  ■  {Ti{p(x))  •  ux).  Vux  £TXB 

(5.4.22) 

From  (5.4.19)  -  (5.4.22),  we  have  the  following  theorem. 

Theorem  5.4.5:  If  q(-)  is  a  motion  of  Lagrangian  system  with  symmetry  and  keeps 
momentum  map  conserved,  then  its  projection  x(-)  in  Q/G ^  satisfies  dynamic  equation 


^  B n (x ,  vFj  ux  B\  L ^  (x ,  vx^j  *  ux  —  F(x ,  vx )  •  ux  -j-  Hp(x)(,na,,  vx ) 


for  any  ux  E  Tx(QfG M),  where 

Mx>v*)  =  ^K(x)(vx,vx)  -  ( V(x)+  (x)p,)), 

F  and  are  defined  in  (5.4.22)  and  (5.4.21),  respectively.  | 

Remarks  5.4.6: 

(1)  In  the  literature  of  the  analytic  mechanics,  the  reduction  theory  for  Hamiltonian 
systems  has  been  well  developed  [26,28,29]  and  has  been  successfully  applied 
to  many  problems  in  engineering.  However,  since  in  many  physical  problems, 
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Lagrangian  dynamics  is  the  natural  starting  point,  the  construction  of  a  reduction 
theory  directly  applicable  to  Lagrangian  systems  becomes  a  challenge.  Recent  work 
done  by  Marsden  and  Scheurle  [27],  and  Bloch  and  Crouch  [7]  have  contributed  to 
this  goal.  In  [27],  the  reduction  is  realized  by  including  conservative  gyroscopic 
(magnetic)  force  into  the  variational  principle  in  the  sense  of  Lagrange  and 
d’Alembert.  The  main  advantage  of  this  approach  is  that  the  reduction  procedure 
is  directly  comparable  with  the  one  in  Hamiltonian  case. 

(2)  Once  the  motion  of  the  system  in  Q/G M  is  found,  the  motion  of  the  system  in  Q 
can  be  determined  using  a  reconstruction  procedure ,  cf.  [26]. 

(3)  Lagrangian  reduction  can  be  applied  to  study  the  stability  of  relative  equilibria  of 

various  mechanical  systems.  For  example,  also  see  [27].  | 

5.5  Examples 

To  illustrate  the  results  of  this  chapter,  we  apply  the  theorems  in  this  chapter  to 
some  representative  physical  systems. 

Example  5.5.1:  Reduced  dynamic  equation  for  planar  3-body  system. 

In  [34,43,44],  the  authors  investigate  the  dynamics  and  control  of  planar  multibody 
systems.  In  particular,  the  relative  equilibria  and  their  stability  are  investigated  using  the 
energy- Casimir  method.  Bifurcation  phenomena  are  explored  by  standard  methods  in 
Hamiltonian  mechanics.  Here,  following  [27],  we  shall  use  Lagrangian  reduced  dynamics 
to  compute  the  relative  equilibria  and  then  discover  bifurcations  via  the  movement  of 
the  eigenvalues  of  the  linearized  dynamics  at  fundamental  relative  equilibria. 

Consider  a  planar  three-body  system  shown  in  Figure  5.5.1.  The  configuration 
space  now  is  51  X  51  x  S 1  and  the  symmetry  group  is  S'1 .  Since  51  is  Abelian,  we  only 
need  to  use  the  formula  in  Section  5.1  for  reduction.  The  Lagrangian  of  this  system, 
after  ignoring  the  potential,  can  be  written  as 

L(<f>J,03)  =  ^4>T  Kn(f>)f>  +  4>T  Kn(cf)03  +  \  K  22  (<£)(  03  )2,  (5.5.1) 

where  <fi  —  (<f\ ,  <fo)T  -  For  the  sake  of  simplicity,  we  assume  the  planar  moments  =  1 
and  the  masses  m;  =  3 ,  for  i  =  1, 2, 3 .  And,  d2\  =  d2 3  =  d32  =  1,  but  d22  =  l  >  0 
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Figure  5.5.1  A  planar  three-body  systems 


which  is  the  only  variable  parameter  for  the  system.  Then,  the  matrices  in  (5.1.18)  are 
of  the  form 


j'  (x\  _  (  6/2  +  1  9lcos((pi )  +  6/2  +  1  \ 

11  {<p>  ~  9 lcos(4>x)  +  6/2  +  l  lSlcos(<f>i)  +  6 Z2  +  20  ) 

K  /±\_(  3lcos(<f>2  +  <pi)  +  9lcos(<j>i)  +  6/2  +  1 

Vl2^  \  3lcos(<f>2  +  <f>i)  +  9cos(<£2)  +  !8lcos(4>i)  +  6Z2  +  20 

/V22(0)  =  Qlcos(<j>2  +  4>\)  +  18  cos(4>2)  +  18/cos(0i )  +  6/2  +  27. 


Using  the  above  matrices  the  reduced  Lagrangian,  /x- component  of  the  curvature  form 
and,  consequently,  the  reduced  dynamics  can  be  easily  determined  from  (5.1.14)-(5.1.16). 
In  particular,  from  (5.1.14),  the  equilibria  of  the  reduced  system  are  determined  by  the 
critical  points  of  K22  which  is  just  the  locked  inertia  of  the  system.  These  equilibria 
are  the  relative  equilibria  of  unreduced  system.  It  is  readily  seen  that  there  are  at 
least  four  critical  points  for  K22,  that  is,  4>  =  {(0,  0),  (0,  tt),  (it,  0),  (x,  7r)}  ,  which  are 
called  fundamental  relative  equilibria  [43].  The  stability  of  these  equilibria  can  be  easily 
determined  by  looking  at  the  spectrum  of  linearized  equation  of  reduced  dynamics.  In 
addition,  we  can  also  see  the  movement  of  the  spectrum  in  response  to  change  of  certain 
parameters  (e.g.,  /)  and  the  bifurcation  points.  In  the  following,  we  study  the  linearized 
equation  at  (7T ,  tt)  only. 
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One  can  show  that  the  linearized  equation  at  (tt./t)  of  the  reduced  dynamics  is 

M<j>  +  Ad>  =  0,  (5.5.2) 


where 


(  24  Z4  -  48  Z3  +  52 Z2  -  32 Z  +  24  12  Z4  -  48  Z3  +  62 Z2  -  28 1  -  6  \ 

\  12  Z4  -  48  Z3  +  62  Z2  -  28  Z  -  6  66  Z4  -  276  Z3  +  505  Z2  -  452  Z  +  177  ) 


and 


ZM2  \ 

Z  /i2  —  3  /z2  y  ‘ 


Then,  the  eigenvalues  of  this  system  are  the  roots  of  the  equation 


det{\2M  +  A)  =  0. 


/ 

'  c 

c  ; 

/ 

‘  c 

< 

-x — X — 

— X — X"  >  _ a 

r  ...  \/  ^  N, 

/x  /N  A  J 

6  *  - x — 

> 

/ 

— X - 

r 

0<  / < 2 


7  =  2 


/  >  2 


Figure  5.5.2  Eigenvalues  of  linearized  equation 

Figure  5.1.2  shows  the  positions  of  four  eigenvalues  in  the  complex  plane  for  three 
different  values  of  Z.  Since  the  eigenvalues  of  the  linearized  system  (5.1.19)  are  also  the 
eigenvalues  of  Hamiltonian  system 

q  -  M~1p 

V  =  -M 

for  p  =  Mq,  following  the  analysis  in  [34],  bifurcation  occurs  at  Z  =  2  necessarily.  In 
fact,  one  can  see  this  bifurcation  from  the  contours  of  amended  potential  function  shown 
in  Figure  5.1.3.  | 
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(a)  l  =  1.8 


(b)  l  =  2.5  (near  (7r,ir)) 


Figure  5.5.3  Contours  of  amended  potential  of  planar  3-body  system 


Example  5.5.2:  Rolling  a  homogeneous  sphere  on  a  rotating  horizontal  plane  [32]. 

Let  0-XYZ  be  a  coordinates  system  fixed  in  inertial  space.  Let  a  homogeneous 
sphere  roll  without  sliding  on  a  plane  (or  platform)  which  rotates  about  the  Z  axis  with 
a  constant  angular  velocity  C ,  see  Figure  5.5.4. 


Since  the  sphere  will  not  move  along  the  Z  direction,  its  configuration  space  is 
Sft2  x  50(3)  which  will  be  parametrized  by  (p,  A),  where  p  =  (x,y)  gives  the  location 
of  center  of  the  sphere,  or  the  contact  point  of  the  sphere  and  the  plane,  A  gives  the 
orientation  of  the  sphere  relative  to  inertial  space  or  coordinates  system  0-  XY Z .  Let 
C  =  (ux,(jjy,u2)T  be  the  vector  of  angular  velocity  of  the  sphere  in  the  inertial  frame. 
Then  the  Lagrangian  of  this  system  is 


L  =  ^m(x2  +  y2)  +  +  w2  +  u\) 

=  \m(i2  +  f)+  ^mk2(CAXA). 


(5.5.3) 


The  rolling-without-sliding  constraint  on  the  sphere  is  given  by  two  equations: 


x  —  aojy  +  Cy  =  0 
y  +  aux  —  Cx  =  0. 
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z 


Figure  5.5.4  Rolling  a  homogeneous  sphere  on  a  rotating  plane. 


We  also  assume  that  there  is  an  exterior  force,  denoted  by  F  =  (Fx,  Fy) ,  acting  at  the 
center  of  the  sphere  and  along  any  direction  perpendicular  to  Z  axis,  where  Fx  and  Fy 
can  be  any  time  dependent  functions  on  T3?2 .  It  is  clear  that  since  there  is  no  torque 
along  the  Z  axis,  the  angular  velocity  of  the  sphere  about  the  Z  axis  is  a  constant, 
denoted  by  c.  As  we  did  in  the  second  part  of  Example  2.2.24,  we  assume  the  conserved 
uz  as  an  a  priori  constraint.  Then  the  constraint  equations  can  be  written  as 
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Then,  it  is  easy  to  check  that  the  left-hand-side  of  (5.5.5)  defines  a  connection  on  the 
right-principal  fiber  bundle  p  -  (R2  x50(3),  3R2, 7T,  50(3)) ,  the  right-hand-side  of  (5.5.5) 
satisfies  the  Constraint  Hypothesis  5.2.1  and  Lagrangian  in  (5.5.3)  is  G-invariant.  Since 
all  the  conditions  for  reduction  in  Section  5.2  and  5.3  are  satisfied  with  respect  to  the 
right  action,  we  can  use  Equation  (5.3.37)  or  (5.3.38),  taking  care  to  change  the  order 
of  vectors  in  all  Lie  brackets,  thereby  obtaining  the  reduced  dynamics  on  the  base  space 
SR2 .  Here,  for  convenience,  we  rewrite  (5.3.38)  with  respect  to  right  action  as  follows: 
for  any  p  G  B , 

^ 

— D2L(p,vp )  •  up  -  D\L(p,  vp)  •  Up  =F(p,vp )  •  up  +  dLj^(p)(up,  vp) 

+  r(p)(up,  Up,  Vp)  +  (I(p)t(p),  Q(p)(up,  Vp)) 

+  ±((DpI(p)-Up)tipU(p)) 

+  (I(pK(p),[^(p)^(p)up]) 

-  E(p)(up,vp),  (5.5.7) 

where 

L(p,vp)=  ^K(p){vp,vp)  -V(p), 
f \p)(up,  Vp)  -  du(p)(up,  vp)  +  [  u(p)  ■  up,u(p)  ■  vp  ] , 

T(p)(vp,  Up,  Vp)  =  K{p,e)([vp,-u(p)-vp],[0,n(p){up,Vp)}), 

E(p)(up,Vp)  =  K(p,e)([vp,-{u(p)vp)],[Q,[u(p)up,Z(p)]  +  D P£(p)  •  up]) 
and,  without  change,  F  and  are  defined  as  in  (5.3.24)  and  (5.3.26),  respectively. 

Let  M  =  Diag(m,m,mk2 ,mk2 ,mk2) .  Then  L  in  (5.5.7)  is 

L  =  ^v^lv(p)t  Mu(p)vv  =  ~ T~~(^2  +  y2)-  (5.5.8) 

The  other  terms  in  (5.5.7)  can  be  written  as  follows.  Let  up  =  (ux,uy).  Then 

F(p,  Vp)up  —  Fxux  -|-  F yUy.  (5.5.9) 

Since  in  this  case  K  and  u  do  not  depend  on  p,  from  (5.3.26),  for  any  w  =  ( wx,wy )  £ 
TVB , 

Z>t.(p)(w)  =  mk^C (ywx  -  xwy) 


151 


and  I(p)  =  mk2I3x3,  which  is  a  constant  matrix.  The  Lie  brackets  in  (5.5.7)  have  the 
following  form:  letting  {ei,e2,e3}  be  the  standard  basis  for  so(3), 

[u(p)up,u(p)vp  ]  =  y  (yu*  -  xuy)e 3 


and 


cuT 


[f,«0»)«p]  =  ~yei  + 


— e2  -  ~(xux  +  yuy)e3. 


Then,  we  have 


mk2C 

^p)  —  ^  ^ xV )* 

(5.5.10) 

r  (p)(vp,up,vp)  =  o, 

(5.5.11) 

,_r  TTlk^  C 

(i (p)t{p),n(p)(uP,v))  =  2  ( vu *  *“»)> 

(JL 

(5.5.12) 

((DpI(p)up)t(pU(p))  =  0, 

(5.5.13) 

(Hp)I(p),U(p)Mp)up])  =  o, 

(5.5.14) 

E(p)(up,  Up)  =  mk2(c  -  C)(yux  -  xuy). 

(5.5.15) 

Substituting  (5.5.8)-(5.5.15)  into  (5.5.7),  one  gets 


xux  +  yuy  =  (- 


k2C 


;V  + 


Fx,  ,  k2C  . 

-K  +  (  ,  .  .,»  + 


F, 


a2  +  k2*  '  a2  +  A:2  m  '  '  v  a2  +  k2  ‘  a2  +  A:2  m 

where  and  uy  are  arbitrary  real  numbers,  or  equivalently, 

k2C 


-)ux,  (5.5.16) 


x  =  — 


V  = 


a 2  +  k 2 

k2C 


y  + 


a 


:x  + 


a2  +  k2  m 
a 2 


’2  F 

1  V 


(5.5.17) 


a2  4-  k2  '  a2  +  k2  m 
If  C  =  0,  i.e.,  the  platform  does  not  rotate,  Equation  (5.5.16)  leads  to  Equation  (2.2.43) 
in  the  example  in  Chapter  II.  If  we  assume  Fx  =  Fy  =  0,  the  solution  of  (5.5.17)  is 


x(t)  =  ycos(ft)  +  ~-sin{lt)  -  y  +  x0 
y(t )  =  -y  cos(Ct)  +  ysm(£f )  +  y  +  y0, 


(5.5.18) 


where  l  =  and  (xo,yo)  and  (io,yo)  are  initial  position  and  initial  velocity  of 

the  center  of  the  sphere.  It  is  interesting  to  note  that  a  trajectory  of  the  center  of 
the  sphere  is  always  a  circle.  But,  if  we  fix  a  coordinates  system,  say  0-X\Yi  ,  on  the 
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rotating  platform,  the  trajectory  of  the  center  of  the  sphere  on  this  coordinates  system, 
or  the  trace  of  the  contact  point  of  the  sphere  on  the  platform,  can  be  very  complicated. 
Figure  5.5.5  shows  some  cases  for  different  choices  of  the  parameters  a  and  k .  In  these 
examples,  we  choose  x0  =  y0  =  0 ,  io  =  1  ,  Vo  =  0 ,  C  =  1 . 

Remark  5.5.3:  Equation  (5.5.17)  is  the  same  as  the  one  in  [32],  which,  however, 
was  derived  from  the  total  dynamics  using  so-called  quasi-coordinates.  In  addition,  this 
system  was  not  identified  as  a  Chaplygin  system  in  [32].  This  is  certainly  not  true  if  a 
non- Abelian  symmetry  is  considered,  as  we  have  shown  above.  I 
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(3).  a  =  l,k  =  0.5 


Y_1 

l 


=  3.1415926,  k  =  1  (0  <  *  <  4.09tt/J) 

ntact  point  of  sphere  on  the  rotating  plane,  (3)  and  (4) 
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CHAPTER  VI 


CONCLUSIONS  AND 
FUTURE  RESEARCH 


Up  to  now,  we  have  explored  various  issues  concerning  with  dynamic  modeling  and 
kinematic  control  for  constrained  mechanical  systems  with  symmetry.  The  fundamental 
mathematical  tools  we  applied  are  group  theory,  symplectic  and  Poisson  geometry,  the 
theory  of  reduction  in  geometric  mechanics  and  the  connection  theory  in  differential 
geometry. 

In  Chapter  II,  we  reformulate  Lagrange-d’Alembert  principle  with  constraints 
using  the  classical  notion  of  virtual  displacement  for  constrained  systems  and  a  modern 
treatment  of  constraints  by  distributions. 

In  Chapter  III,  we  rigorously  studied  the  kinematics  and  dynamics  of  a  particular 
mechanical  system  with  holonomic  constraints:  floating  four-bar  linkage.  We  have 
revealed  the  kinematic  and  dynamic  features  of  such  system  in  comparison  with  the 
systems  without  geometric  constraints. 

In  Chapter  IV,  we  formulated  a  kinematic  control  problem  for  the  system  consisting 
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of  a  rigid  body  with  two  oscillators.  From  a  Hamiltonian  viewpoint,  we  explored 
the  reduction  and  explicit  solvability  of  related  optimal  control  problems  on  principal 
bundles  with  connection.  The  necessary  conditions  for  the  optimal  control  problem  were 
determined  intrinsically  by  a  perturbation  method  and  a  Hamiltonian  formulation.  By 
identifying  the  structure  group  of  the  principal  bundle  as  a  symmetry  group  of  the  system, 
we  were  able  to  use  the  Poisson  reduction  procedure  to  reduce  the  order  of  differential 
equations  given  by  the  corresponding  necessary  conditions.  Under  suitable  hypotheses 
and  approximations,  we  found  that  the  reduced  system  possesses  additional  Abelian 
symmetry.  Applying  Poisson  reduction  again,  we  obtain  a  further  reduced  system  and 
corresponding  first  integral. 

In  Chapter  V,  we  turned  to  the  formulation  of  reduced  dynamic  equation  for 
nonholonomic  Lagrangian  systems  with  symmetry.  Under  our  hypotheses  on  constraints 
and  exterior  force,  we  showed  that  the  dynamics  of  a  nonholonomic  Lagrangian  system 
with  non- Abelian  symmetry  can  be  reduced  to  a  lower  dimensional  space  determined  by 
the  principal  fiber  bundle.  The  reduced  dynamic  equations  were  formulated  explicitly, 
i.e.,  without  Lagrange  multipliers.  This  formulation  generalizes  the  one  for  classical 
Chaplygin  systems  which  possess  Abelian  symmetry,  and  the  one  having  non- Abelian 
symmetry  but  with  linear  constraints.  In  addition,  if  the  mechanical  connection  of 
Kummer  and  Smale  is  considered,  our  formulation  for  nonholonomic  Lagrangian  systems 
specializes  to  the  one  in  Lagrangian  reduction  discovered  recently  by  Marsden  and 
Scheurle. 

We  end  this  chapter  by  introducing  future  research  relating  to  the  contents  of  this 
dissertation. 

Concerning  the  kinematic  control  problem,  more  general  methods  for  complete 
integrability  of  optimal  control  on  principal  bundles  with  connection  will  be  further 
investigated.  Some  methods  to  study  general  optimal  control  problems  with  nilpotent 
control  algebra  may  be  exploited  for  this  purpose  [4,12].  On  the  application  side,  the 
computation  of  geometric  phase  and  related  optimal  control  problems  for  more  concrete 
examples,  such  as  a  rigid  body  with  flexible  attachments,  will  be  studied. 
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In  reduction  theory  for  nonholonomic  systems  with  symmetry  we  discovered  in 
Chapter  V,  a  gyroscopic  type  of  force  appeared  in  the  reduced  dynamic  equations.  Unlike 
the  gyroscopic  force  coming  from  the  linear  term  of  of  the  Lagrangian  (or  internal  effect 
of  the  dynamics  of  the  system)  [50],  this  force  comes  from  the  nonholonomic  constraints 
(or  external  effect  of  the  environment).  Questions  about  the  control  of  this  type  of 
gyroscopic  systems  and  the  systems  which  also  include  the  internal  gyroscopic  effects 
will  be  answered  in  future  research. 

There  are  other  types  of  constrained  problems  in  analytical  mechanics,  such 
as  systems  with  Dirac  constraints  coming  from  degenerate  Lagrangian  [10,11]  and 
constrained  Hamiltonian  mechanical  systems,  in  which  the  constraints  are  characterized 
by  the  distributions  on  the  cotangent  bundle  of  configurations  space  [51].  How  to  deal 
with  reductions  for  the  systems  with  these  constraints  is  also  one  of  our  research  interests 
in  the  future. 

Finally,  we  note  that,  in  Chapter  V,  the  number  of  constraint  equations  in  our 
Constraint  Hypothesis  is  required  to  equal  the  dimension  of  the  structure  group  of  given 
principal  bundle.  How  to  relax  this  condition  will  also  be  investigated. 
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